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(57) Abstract: The present 

invention is related to mammary 
tumor virus (MTV). MTV represents 
a group of retroviruses which possess 
very high homology to mouse 
mammary tumor virus (MMTV), 
a virus known to cause neoplastic 
mammary disease in mice. As 
described herein, MTV's have been 
identified in human, cat, and Rhesus 
macaque. The present invention 
specifically provides for recombinant 
nucleic acids and polypeptides 
derived from these MTV's as well 
as methods for using these biological 
molecules. 
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HUMAN ENDOGENOUS RETROVIRUS IN BREAST CANCER 

BACKGROUND OF THE INVENTION 

The present application takes priority from Provisional U.S. patent application number 
60/141,626 filed on June 30, 1999 the entire text of which is specifically incorporated herein by 
reference. 

1. Field of the Invention 

The present invention relates to compositions and methods for the detection, prognostic 
evaluation, and treatment of oncogenic disorders, particularly breast cancer. 

Specifically, the instant invention provides for compositions, useful for identifying and 
treatmg borders related to newly identified endogenous retroviruses which are present in a 
subset of humans, cats, and nonhuman primates. 

2. Technical Problem Addressed by the Invention 

Mutations in known susceptibility genes do not account for all breast cancer 

Breast cancer (BC) is one of the leading causes of cancer death among women The 
mduction of BC is thought to involve the mterplay of several factors, including the genetic 
hormonal, immunological and physiological status of the host, as well as dietary habits and 
exposures to chemicals, radiation or infectious agents. It is now clear that variations of several 
genes, including BRCA-1 and BRCA-2, can result in greatly increased risks for development of 
BC. However, defects in known BC susceptibility genes account for only about 5% of BC, the 
so-called familial cases (Armstrong et al, 2000; Gayther et ai, 1998). 

As with other types of cancer, the possibility that a virus is etiological^ involved in 
sporadically occurring BC has not been eliminated. Consequently, there has long been a need to 
determine winch, if any viruses are causally linked to the development of BC. The identification 
of such a virus would likely provide invaluable aid in the following areas BC medicine: 
prevention, diagnosis, determination of prognosis, and treatment. 
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3. Description of Related Art 
Retrovirus induction of breast cancer in mice 

Mouse mammary tumor virus (MMTV), a B-type retrovirus, was discovered during 
studies of hereditary cancer in mice at the Jackson Laboratories in the 1930's (Bittner, 1936). As 
5 the prototype of slow-transforming retroviruses, MMTV has been definitively shown to cause 
BC in mice. Prior studies established that MMTV is transmitted both in the germline as 
endogenous proviruses and exogenously via milk. As endogenous elements MMTV proviruses 
follow patterns of Mendelian inheritance, as other sequences in the genome (Cohen et al, 
Cell, 1979; Cohen and Varmus, 1979, 1980; Traina et al., 1981; Traina-Dorge and Cohen, 1983; 
io Traina-Dorge et al, 1985; Varmus et al., 1978). Horizontal transmission of MMTV typically 
occurs by infection of mouse pups by MMTV virions present in the milk of infected dams. 
Thus, it is possible to transmit MMTV to mice by foster feeding. 30 or more unique proviral^ 
integration sites for endogenous MMTV have been identified. However, some wild mice do not' 
carry any endogenous MMTV proviruses (Cohen and Varmus, 1979; Cohen et al, 1982). This 
15 result suggests that the many endogenous MMTV proviruses are relatively recent additions to the 
mouse genome. The most likely explanation is that MMTV entered the germline of certain mice 
(but not others) on multiple occasions after the evolutionary splits among the various species and 
subspecies of the genus Mus. Certain endogenous MMTV can be activated by hormones to form 
infectious virions capable of inducing mammary carcinomas after long latency periods. Most 
20 endogenous MMTV proviruses are defective and do not encode for infectious virions. 

Roles of MMTV genes and cellular genes in oncogenesis 

The MMTV Orf protein can function as a superantigen (SA). When expressed in the 
thymus during fetal/early development it can mediate complete or incomplete deletion of SA- 
reactive T-cells. SA expression is required to activate B-cells targets of MMTV in the 
gut-associated lymphoid tissue of nursing pups. Complete deletion of the SA responsive clones 
thus renders the mice resistant to MMTV infection in the gut and thereby leads to a low 
incidence of MMTV-induced tumors. On the other hand, in mice with only partially deleted 
responsive clones of lymphoid cells the SA activation stimulates expansion of the targets and 
spread of MMTV. As female infected animals reach puberty, estrogenic hormones drive 
30 expression of the MMTV long terminal repeat (LTR) through its hormone response element 



25 
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(HRE). This permits production and assembly of MMTV and spread of the virus to other 
hormonally-sensitive tissues, including the breast and ovaries. Integration of MMTV LTRs 
adjacent to certain cellular genes, such as the proto-oncogenes Int, Wnt and Fgf, can increase 
expression of these genes resulting in BC and other cancers. 

5 The molecular genetic interactions between MMTV, the immune system of its murine 

host, and the breast and other hormonally-sensitive cells malignantly transformed by this 
retrovirus have been extensively studied. MMTV promotes mammary gland cancer in mice by 
insertional mutagenesis (Varmus et al. t 1978; Varmus, 1985). MMTV proviral LTR elements 
direct steroid hormone-dependent transactivation of various cellular oncogenes including Wnt, 

10 Fgf and Int thereby promoting clonal expansion of tumor cells (Shackleford and Varmus, 1987, 
Shackleford et al, 1993; Jakobovits et aL, 1986; Nusse, 1991; Nusse et aL, 1985). For 
productive, persistent infection and completion of its replication cycle, MMTV must contain a 
superantigen and interact with a functional host immune system (Golovkina et aL, 1995; Luther 
and Acha-Orbea, 1996; Coffin, 1992). 

15 The search for a human breast cancer virus 

The discovery of the oncogenic MMTV has prompted many investigators to explore a 
retroviral etiology for BC in humans (Sarkar, 1980). Data collected over the past five decades 
has suggested the existence of a human homologue of MMTV. In 1971, Moore and associates 
reported that 60% of human milk samples from BC patients contain B-type panicles 

20 indistinguishable from MMTV by electron microscopy, compared to 5% of the general 
population (Moore et aL, 1971). These investigators also reported that 39% of Parsi women of 
India, an inbred population with a two-fold increased incidence of BC, had B-type particles in 
their milk (Das et aL, 1972; Moore, 1971). Several studies have demonstrated that BC cells, but 
not cells from normal tissues, also contain reverse transcriptase (RT), an enzyme associated with 

25 all retroviruses. Numerous investigators have examined serum and breast milk for the presence 
of antibodies reactive with MMTV. Most of these studies were performed in the pre- AIDS era, 
prior to the advent of highly sensitive and specific techniques, for detecting anti-retroviral 
antibodies made necessary for detection of HIV antibodies in donated blood. 

Despite the numerous electron microscopic, biochemical and immunological studies on 

30 human breast carcinoma tissue, milk, patients' sera, and breast carcinoma cell lines suggesting 
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the existence of a human homologue of MMTV, proof that such an agent exists has remained 
elusive (Andersson et al, 1996; Ziegler, 1997). Most authors have dismissed the importance of 
prior studies purporting to show evidence of a human homologue of MMTV because of the 
presence of numerous human endogenous retroviruses (HERVs) (Larsson et al, 1994; Li et al, 

s 1996; Lower et al, 1996; Meese et al, 1996; Ono, 1986; Patience et al, 1996; Faff et al, 1992). 
There are about 50,000 HERVs or HERV-related sequences in the human genome, some of 
which have been shown to have up to 60% homology to MMTV. In this regard, it is important 
to note that seroreactivity to HERV-K10, to this point the HERV considered to be most closely 
related to MMTV, cannot account for MMTV-reactive antibodies present in the sera of breast 

10 cancer patients and the smaller number of healthy individuals (Vogetseder et al, 1995). 
Furthermore, we believe that the presence of these MMTV-related sequences is precisely the 
reason that human homologues of MMTV have not previously been demonstrated conclusively '' 
by molecular techniques. The presence of these related, but distinct, sequences could have 
obscured the detection of more closely related sequences by prior investigators who used less 

15 sensitive techniques, such as Southern blotting. 

Only recently have sequences with relatively high homology (>90%) to those of MMTV 
been isolated from human BC tissue (Wang et al, 1995, 1998; ). Sequences 95 - 99% similar to 
MMTV env were amplified by PCR in 121 (38.5%) of 314 unselected breast cancer tumor 
samples. It is pertinent to note that the MMTV-like sequences were detected in only 2 (1 .8%) of 

20 107 breast specimens from reduction mammoplasties and in 0/80 samples from normal tissues or 
non-breast tumors. The MMTV-e/?v like RNA was expressed (as determined by RT-PCR) in 
66% of DNA PCR positive breast tumors (Wang et al, 1998). A complete 9.9 kb provirus with 
94% similarity to MMTV was detected in 2 breast tumors. FISH (fluorescence in situ 
hybridization) revealed integration at several sites in DNA derived from BC tumors, but not 

25 normal breast cells (Wang et al., 1999 ACR mtg. abstracts #2933, 2944). Wang et al suggested 
the existence of a human mammary tumor virus (HMTV) that is spread by the exogenous route 
of infection (horizontal transmission). Attempts by these and other investigators to amplify other 
regions of MMTV-related viruses, from the genomic DNA or cDNA of subjects who did not 
have BC, yielded HERV sequences (such as HERV-K10) with only about 60% homology to 

30 MMTV. Thus, BC tissues are the only tissues in which sequences that are highly similar to those 



DCIO <WO 0100829A2_I_> 



WO 01/00829 



PCT/US00/18279 



-5- 

of MMTV have been heretofore found. Consequently normal breast tissue and other tissues 
appeared to be negative for the expression of viruses with high homology to MMTV. 

MMTV can be transmitted horizontally or vertically 

The retrovirus replication cycle is characterized by conversion of the single-stranded 

5 RNA viral genome into doubled-stranded proviraJ DNA by the multiple enzymatic activities of 
the virion-associated reverse transciptase (RT). As is the case with other retroviruses, 
integration of MMTV proviral DNA into the genome of host cells is required for expression of 
viral proteins and production of infectious progeny. In both infected mouse mammary glands as 
well as heterologous cells MMTV proviral DNA is integrated into a large number of apparently 

10 random sites. Integration of MMTV proviruses containing transcriptionally active LTR near 
some cellular genes (proto-oncogenes), such as Int, Wnt, and Fgf can result in over-expression of 
these genes, cellular transformation and clonal expansion of the tumor cells (Varmus, 1985; 
Shackleford et a!., 1993; Jakobovits et al, 1986; Cohen, 1980; Breznik and Cohen, 1982). The 
long latency of MMTV-induce carcinogenesis is explained in part by the necessity for proviruses 

15 to integrate into these particular sites. 

Genetic differences among viral strains of MMTV can account in part for the varying 
incidence of BC in diverse strains of mice. Mice of the C3H strain have a greater than 90% 
incidence of BC, compared to a <1% incidence of BC in BALB/c mice. BALB/c mice foster- 
nursed on C3H females have a high incidence of BC which suggested that the tumorigenic 

20 MMTV of C3H can be horizontally transmitted in the milk (Bittner, 1936). Conversely, when 
C3H mice are foster nursed on a BALB/c female the incidence of BC is significantly lower (22- 
55%), but not as low as low-incidence mouse strains. This latter observation underscores the 
importance of the horizontally-transmitted milk-born virus in the high incidence strains, but also 
indicates the substantial differences in tumorigenic potential among endogenous MMTV 

25 proviruses. Proviruses of various mice with high and low tumor incidence could be 
distinguished by differences in solution hybridization kinetics and restriction endonuclease 
digestion patterns (Cohen et al Cell, 1979; Cohen and Varmus, 1979, 1980; Traina-Dorge and 
Cohen, 1983; Breznik et al 1984). Using restriction enzymes that differentiate between 
hypomethylated and methylated DNA, proviruses of milk-borne MMTV were also shown to be 

30 hypomethylated, whereas most endogenous proviruses contain abundant 5-methylcytosine 



X:iD: <WO 01008 29A2_t_> 



WO 01/00829 



PCT/US00/18279 



-6- 

(Cohen, 1980; Breznik and Cohen, 1982; Breznik et al, 1984). Because hypomethylation is 
associated with increased gene expression, this .observation could explain the importance of 
horizontally-transmitted milk-borne MMTV in mouse strains with a high incidence of BC. 
Specific hypomethylation of an endogenous MMTV provirus was associated with expression of 
5 a 1.6 kb transcript of the lactating mammary gland of the BALB/c mouse (Traina-Dorge et al, 
1985). 

In other studies, endogenous MMTV proviruses have been shown to segregate as stable 
genetic units during inbreeding and that certain of these endogenous proviruses are 
transcriptionally active (Cohen et al, Cell, \979; Cohen and Varmus, 1979, 1980; Traina et al, 

io 1981; Traina-Dorge and Cohen, 1983; Traina-Dorge et al, 1985; Varmus et al, 1978). 
Recombinant inbred (RI) strains of mice have been used to define MMTV provirus composition 
and chromosomal locations (Traina et al, 1981; Traina-Dorge and Cohen, 1983). RI strains^ 
were developed by crossing mice from two highly inbred lines, randomly mating the F2 
generation brothers and sisters and maintaining each as separate lines. Many genetic loci were 

15 mapped to specific chromosomes in these strains. By analyzing the segregation patterns of 
various MMTV proviruses with these genetic markers by restriction endonuclease analysis and 
Southern blotting, it was possible to determine the chromosomal locations of a number of 
MMTV proviruses in these strains and to establish that these provirus segregate by the rule of 
Mendelian inheritance. These and other analyses defined 10 distinct MMTV units (proviruses) 

20 in these animals some full length and some truncated. The ratios of inheritance for most of the 
MMTV proviruses were consistent with simple single-gene inheritance, though three of the 
MMTV units demonstrated some variance. Most noticeable was the presence of a unit 
contributed by one parent that was present in 23 of the 26 RI strains analyzed. Specific proviral 
units were identified that were transcriptionally active and associated with increased tumor 

25 production (Traina-Dorge et al, 1985). Importantly, cellular genes were identified that were 
significantly associated with virus expression showing the involvement of host genetics in 
disease progression (Traina-Dorge et al, 1 985). 

Several lines of evidence indicate that the endogenous MMTV proviruses have integrated 
relatively recently in the germline of various strains of mice (Cohen and Varmus, 1979; Varmus 

30 et al, 1978). If MMTV evolved from elements present in a progenitor of Mus musculus (the 
laboratory mouse), it would be expected that all individual mice would have similar MMTV 
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proviruses. However, both laboratory mice and wild mice trapped at several locations having 
quite variable numbers and distributions of germline integration sites for their MMTV proviruses 
(Cohen and Varmus, 1979). The most striking finding among these results were the presence of 
some wild caught animals that contained no endogenous MMTV proviruses. The most likely 
5 interpretation of these results is that the endogenous MMTV proviruses arose by multiple 
independent integrations into the DNA of germinal cells after speciation of the genus Mus, rather 
than arising from genetic elements present in the evolutionary progenitors of mice. 

SUMMARY OF THE INVENTION 

The present invention provides for recombinant DNA molecules derived from one or 
10 more mammary tumor viruses which are endogenous retroviruses with homology to the 
sequences of MMTV. Specifically, the sequences of the instant invention have at least 99% 
identity with all or a portion of SEQ ID NOs: 2, 3, 4, 5, 6, 7, 8, 21, 23, 25, 27, or 29 or at least 
92% identity with SEQ ID NO: 10. 

Additionally, the present invention provides for the RNA molecules produced by 
15 transcription of the DNA described above. Furthermore it provides for the polypeptides resulting 
from the in-frame translation of RNA which has been transcribed from the MTV DNA of the 
instant invention. According to the present invention, the referenced DNA sequences may be 
derived from any suitable source, such sources may include, but are not limited to, human, cat, 
and rhesus macaque. 

20 The present invention also provides for a recombinant DNA plasmid (a vector) which 

comprises mammary tumor virus (MTV) DNA the sequences of SEQ ID NOs: 2, 3, 4, 5, 6, 7, 8, 
10, 21, 23, 25, 27, or 29, or sequences which have at least 99% identity thereto. That is said 
DNA sequence is incorporated in the vector. 

In one embodiment of the present invention the vectors further comprise a heterologous 

25 promoter operably linked to the MTV sequence (i.e. joined in the proper reading frame so as to 
be capable of producing functional MTV RNA and/or protein in vivo and in vitro). In one aspect 
of this embodiment of the invention the vector, which comprises the MTV DNA, is capable of 
episomal replication or chromosomal integration in at least one of the following cell types: 
bacterial cells, yeast cells, insect cells , avian, cells, and mammalian cells (this list of cell types is 

30 representative and should not be considered exhaustive). In another aspect, of this embodiment 
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of the invention, the heterologous promoter provides for the expression of the MTV DNA 

sequence in one or more cell types. Cell types considered useful as part of this aspect of the 

invention include, but are not limited to the following: bacterial cells, yeast cells, insect cells , 

avian, cells, and mammalian cells. 
5 According to another embodiment of the instant invention the MTV DNA sequences 

described above may be used to provide a method of detecting the presence of MTV DNA or 

RNA in a sample (of biological origin, such as serum, or otherwise). 

Another embodiment of the instant invention provides for a method of determining 

whether a sample contains antibodies which recognize proteins derived from the MTV DNA 
10 sequences described above (e.g. polypeptide sequences derived from transcription and translation 

of SEQ ID NOs: : 2, 3, 4, 5, 6, 7, 8, 10, 21, 23, 25, 27, or 29). As a corollary to this aspect, the 

instant invention also provides for antibodies which specifically detect one or more of the., 

polypeptides of the instant invention. 

Another embodiment of the invention provides for diagnostic kits useful for detecting 
is DNA, RNA, or polypeptides, from a mammary tumor virus, in a biological or other type of 

sample. 

Another embodiment of the instant invention provides for methods of attenuating or 
eliminating the activity of MTV in its host animal. Various aspects of this embodiment provide 
for pharmaceutical compositions comprising substances which disrupt the activity of the MTV 
20 reverse transcriptase, protease, or integrase enzymes. Other aspects of this embodiment provide 
for pharmaceutical compositions capable of eliciting an immune response in a host animal. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The following drawings form part of the present specification and are included to further 
demonstrate certain aspects of the present invention. The invention may be better understood by 
25 reference to one or more of these drawings in combination with the detailed description of 
specific embodiments presented herein. 

Figure 1 shows the results of amplification of sequences related to MMTV from human 
breast cancer tissue. Panel A: DNA was extracted from human breast tumors (kindly provided 
by Michael Press, M.D., USC, Los Angeles or Derrick Beech M.D., TMC/UT Memphis) and 
30 PCR was performed . using primers specific for the human MMTV env-related gene. PCR 
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products were transferred to nitrocellulose by blotting and MMTV-related products were 
detected by hybridization to a 1.8 kb MMTV env probe, lane a: nonradioactive markers, not 
shown; lanes b-d, f-h, j: breast tumor DNA; lanes e and i: no DNA; lane k: water control; lane 1 
positive control: MMTV env fragment cloned in pBluescript™. Panel B: As a test for the 

s integrity of the DNA from the clinical samples we amplified HERV-3 proviral DNA, a single 
copy human endogenous retrovirus using PCR conditions developed by Griffiths et al (1997). 
Ethidium bromide detection (markers are visible in lane a). Same samples as Panel A except 
lane 1: positive control, HERV3 pol fragment cloned in pBluescript™ (Stratagene). Visible 
bands were present in lanes c and g, but do not copy well. 

]0 Figure2 shows an example of the amplification of sequences related to MMTV from the 

blood of healthy controls. Panel A: DNA was extracted from whole blood of healthy control 
subjects and PCR was performed using primers specific for the human MMTV ewv-related gene 
and PCR products were detected by Southern hybridization, lane a: markers; lanes b-k: DNA 
from whole blood of healthy controls; lane 1: water control; lane m positive control: MMTV env 

is fragment in pBluescript™ (Stratagene). Panel B: PCR amplification of HERV-3. Ethidium 
bromide detection (markers are visible in lane a). Same samples as Panel A except lane m: 
positive control, HERV3 pol fragment in pBluescript™. 

Figure 3 shows an example of the detection of HMTV mRNA by ribonuclease protection 
assay. RNA was extracted from three HMTV PCR positive breast cancer tumors or from yeast 

20 cells and hybridized to either 1 89 base probe specific for HMTV or a 245 base probe for P-actin. 
Samples were then digested with RNAse A/Tl . The fragments of the labeled probes protected by 
hybridized RNA were visualized and analyzed following separation on a denaturing 
polyacrylamide gel. Two of the three tumor samples gave protected fragments of the expected 
size with the HMTV probe (arrow) (lanes 1-3), whereas all three tumor RNAs gave protected 

25 fragments with the 0-actin probe (lanes (5-7). As expected neither probe was specifically 
protected by yeast RNA (lanes 4, 8). 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides for isolated DNA, RNA, and polypeptides derived from 
mammary tumor virus (MTV) which is a virus with high sequence homology to mouse 
30 mammary tumor virus. The present invention further provides for diagnostic methods of using 
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these macromolecules. Additionally the present invention provides pharmaceutical reagent 
compositions and diagnostic kits comprising MTV DNA, RNA, and/or polypeptide, which can 
be used according to the disclosed methods. 

Various embodiments of the instant invention provide for MTV nucleic acid 
5 corresponding to at least one of the following: DNA sequences with at least 99% identity with 
one of the following: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21, 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous bases 
of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence represented by 

io nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous nucleotides of the 

sequence represented by nucleotides 1-400 of SEQ ID NO:4, at least 130 contiguous 
nucleotides of the sequence represented by nucleotides 1-360 of SEQ ID NO:5, at 
least 130 contiguous nucleotides of SEQ ID NO:6, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-380 of SEQ ID NO:7, at 

is least 210 contiguous nucleotides of the sequence represented by nucleotides 20-462 

of SEQ ID NO:8, at least 160 contiguous nucleotides of the sequence represented by 
nucleotides 97-462 of SEQ ID NO:21, at least 170 contiguous nucleotides of SEQ ID 
NO:23, at least 100 contiguous nucleotides of the sequence represented by 
nucleotides 337-462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the 

20 sequence represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 

contiguous nucleotides of SEQ ID NO:29; or 

c) at least 92% identity with SEQ ID NO: 1 0.. 

Preferably the sequences have greater than 99% identity with greater than 200, 250, 300, 
350 or 400 nucleotides SEQ ID NO's: 2, 3, 4, 5, 6, 7, 8, 21, 23, 25, 27, or 29. In another 
25 preferred embodiment the sequences have greater than 93%, 94%, 95%, 96%, 97%, 98% or 99% 
identity with SEQ ID NO: 10. Even more preferably these nucleic acids are derived from human, 
cat, or rhesus macaque MTV's. Most preferably the DNA is identical with all or part of SEQ ID 
NO's: 2, 3, 4, 5, 6, 7, 8, 10, 21, 23, 25, 27, or 29. 

In one aspect of this embodiment of the invention the MTV DNA sequences described 
30 above are incorporated in a vector. In various related aspects the invention the MTV sequences 
are under the transcriptional control of an heterologous promoter. The vectors contemplated as 
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being useful according to the instant invention are capable of expressing the MTV DNA 
sequences in at least one of the following cell types: insect cells, bacterial cells, avian cells, 
yeast cells, or mammalian cells. Furthermore the vectors of this aspect of the invention are also 
capable of episomal replication and/or chromosomal integration in at least one of the cell types 
5 listed. 

In another aspect of this embodiment the DNA sequences comprise one or more detection 
moieties. Detection moieties contemplated as being suitable for the instant invention include, 
but are not limited to fluorescent dyes, and radioactive isotopes of phosphorous, sulfur, oxygen, 
carbon, or hydrogen. 

io In yet another aspect of this embodiment of the invention the isolated DNA molecule is 

suspended or dissolved in a diluent compatible with the invention. Suitable diluents do not 
interfere with the use of the DNA according to the various embodiments of the instant invention. 
Diluents useful according to this embodiment of the invention are well known to those skilled in 
the art. They include but are not limited to buffered aqueous solutions. These may be buffered 

15 with any compound compatible with the present invention. Exemplary buffering agents include 
phosphate buffers and buffers comprising trishydroxyaminomethane (Tris). Such buffered 
aqueous solutions may further comprise any other compound, such as sodium chloride, which 
will not interfere with the operation of the instant invention. According to this aspect of the 
instant invention the MTV DNA may be present in any suitable concentration. Preferably the 

20 concentration if from about 0.1 ng/jil to 100 jag/fal. 

Other embodiments of the instant invention provide for RNA transcripts and/or 
polypeptides encoded by the nucleic acids described above. In various aspects of this 
embodiment these RNA transcripts or polypeptides are encoded by any of the DNA sequences 
described above. These RNA transcripts and polypeptides are homologues of RNA transcripts 

25 and polypeptides encoded by the MMTV env, gag, or pol genes. In one aspect of this 
embodiment the RNA transcripts are encoded by the DNA sequences described above. Such 
RNA transcripts have a sequence identical to the disclosed DNA sequence except that the RNA 
contains uridine instead of thymidine. In another preferred aspect of this embodiment the 
purified polypeptide corresponds in sequence with all or part of an env, gag, or pro, protein 

30 product from a human, cat, or rhesus macaque mammary tumor virus. It is preferred that the 
peptides be purified polypeptides. In a preferred aspect of the present invention the purified 
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polypeptides are comprised of all or par, of the sequence resulting f rom the in . ftame 
inscription and translation of any of «he MTV DNA sequences described above In an even 
more preferred aspec, of the presen, invention ,he peptides correspond in sequence to ft. 
product of an in-frame translation of SEQ ID NO's: 2, 3, 4, 5, 6, 7, 8, 10, 21, 23 25 27 or 29 
. Mos, preferably the polypeptide sequences of the in*an, invention correspond ro au'or at ,eas, 
80 amino acids of SEQ ,D NO's ,2, ,3. 14, ,5, ,6, ,7, ,8, 20, 22, 24, 26, or 28. Tbe rem, 
denved from MTV DMA" is meant to convey ^ ^ ^ „ ^ ^ 

o me RNA or peptide produced respectively by the tmnacription or Ascription and in-frame 
translation of MTV DNA. 

» By "purified polypeptides" i, is mean, tha, the majority (greater , han J0 %) of the 

po.ypep.tdes in the sample are the MTV polypeptides. Preferably, the MTV peptides 
confute gma.e, d™ 70% of the polypeptide in the purified polypeptide sample. More* 
preferably the MTV polypeptide constitutes greater dtan 90% of the polypeptide in the sample- < 
Even more preferably the MTV polype^ consIilutes ^ ^ 9J% of ^ ' _ 

» *e samp,e. Additionally, the tern, -purified polypeptide- indicates that the sample does no. 
contatn subs.ances which interfere with me operation of me i„s,an« invention. Purified 
polypepttdes obtained in accordance with the cutren, invention may be from any suitable source 
whtch is compatible with the instant invention. 

Another embodiment of the instan, invention provides for an antibody against an MTV " 
- polypeptide as described above. Antibodies according ,o this aspec, of the invention are 
generated usmg one or more of ,he po.ypeptides of tire instan, invention as me antigenic agen, 
Such a„„bod,es may be either monoclonal or polyclonal in na,ure. Prefcrabiy the antibodies are 
monoclonal. Once the MTV has been provided the antibodies of tins aspec, of «he invention may 
be prepared according to methods well known ,o those skilled in the art. For example 
* polyclonal antibodies are commonly produced by injecting the antigenic agen, (in «he presence' 
of an tmmune response enhancing agen, such as complete Freunds adjuvant) into an animal such 
as a sheep, bovine, equine, goat, or rabbit. 

Monoclonal antibodies according ,o this aspec, of the present invention can be prepared 
by hybrtdoma fusion techniques or by techniques tita, utilize Epstein Barr Vints (EBV) - 
t» _l iza ti 0 „ .echnologies (,o produce human tnAbs), such as are well known by those of ski,, 
■n the an, modified as described herein. In the method of the invention, these techniques involve 
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the injection of an immunogen, in this case, purified MTV polypeptide so as to elicit a desired 
immune response in that animal (i.e., production of antibodies). The experimental animal, (e.g., a 
mouse) is given repeated injections (boosts) of the same immortalized cell line. In a final step, 
the animal is given an injection of primary cells of the chosen cell type. 

5 In the illustrative example herein for the production of agonist monoclonal antibodies to 

megakaryocyte cells, a CMK cell preparation and a CMS cell preparation were used as the first 
immunogens; however, other immortalized megakaryocyte cells, such as Mo7e or DAMI cells, 
could have been used. Other monoclonal antibodies analogous to the agonist antibody of the 
invention BAH-1, which specifically recognizes the C-Mpl receptor, can be generated using 

10 membrane bound c-Mpl receptor protein as the immunogen. To generate agonist monoclonal 
antibodies against other cell types, other cells of hemopoietic lineage are chosen, e.g., stem cells, 
B cells or T cells. In the first immunization step stem cells can be represented, e.g., by the 
immortalized cell line CTS; B cells by the immortalized cell lines ARH-77, SB or Nal-6; and T 
cells by the immortalized cell lines Jurkat or H9. 

15 After a sufficient time, the animal is sacrificed and somatic antibody-producing cells may 

be derived from the lymph nodes, spleens and peripheral blood of primed animals. Spleen cells 
are preferred. Mouse lymphocytes give a higher percentage of stable fusions with the mouse 
myelomas described below. The use of rat, rabbit, frog, sheep and other mammalian somatic 
cells is also possible. The spleen cell chromosomes encoding desired immunoglobulins are 

20 immortalized by fusing the spleen cells with myeloma cells, generally in the presence of a fusing 
agent such as polyethylene glycol (PEG). Any of a number of myeloma cell lines may be used as 
a fusion partner according to standard techniques; for example, the P3-NSl/i-Ag4-l, P3-x63- 
Ag8.653 or Sp2/0-Agl4 myeloma lines. These myeloma lines are available from the American 
Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 201 10-2209. 

25 The resulting cells, which include the desired hybridomas, are then grown in a selective 

medium, such as HAT medium, in which unfused parental myeloma or lymphocyte cells 
eventually die. Only the hybridoma cells survive and can be grown under limiting dilution 
conditions to obtain isolated clones. The supernatants of the hybridomas are screened for the 
presence of antibody of the desired specificity, e.g., by immunoassay techniques such as those 

30 described herein, using the antigen that has been used for immunization. Positive clones can then 
be subcloned under limiting dilution conditions and the monoclonal antibody produced can be 
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isolated. Various conventional methods exist for isolation and purification of the monoclonal 
antibodies so as to free them from other proteins and other contaminants. Commonly used 
methods for purifying monoclonal antibodies include ammonium sulfate precipitation, ion 
exchange chromatography, and affinity chromatography. Hybridomas produced according to 
5 these methods can be propagated in vitro or in vivo (in ascites fluid) using techniques known in 
the art (see, generally, Harlow et al., Antibodies. A Laboratory Manual, Cold Spring Harbor 
Laboratory, pp. 1-726, 1988). 

Other embodiments of the instant invention provide methods for detecting DNA, RNA, 
and/or proteins from MTV in biological and/or other types of samples. 
io One aspect of this embodiment of the invention provides a method for the detection of 

MTV DNA in a sample which comprises the following steps: 

i) obtaining an sample suspected of containing one or more of the MTV DNA* 
sequences described above; 

ii) carrying out a polymerase chain reaction (PCR) to amplify a DNA sequence as 
15 defined in step i); and, 

in) determining the sequence of, or otherwise characterizing, the amplicons (the PCR 
amplified DNA) produced in step ii) to determine whether or not the DNA 
sequence as defined in step i) is present in the sample. 

Another aspect of this embodiment provides a method for the detection of MTV RNA in 
20 a sample which comprises the following steps: 

0 obtaining an sample suspected of containing RNA which encoded by one or more 
of the MTV DNA sequences described above; 

iii) carrying out an RNAse protection assay (RPA); and, 

iii) analyzing the RPA results to determine whether RNA as defined in step i) is 
25 present in the sample and optionally quantitating said RNA. 

It will be recognized by those of ordinary skill that the selection of the parameters 
necessary for optimizing the steps of these methods are well within the abilities of the ordinarily 
skilled artisan. These parameters, for example PCR conditions (e.g. selection of annealing 
temperature, extension times, and primers) and DNA sequencing method, are routinely 
30 determined in labs where such molecular biological techniques are employed. 
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Another aspect of this embodiment of the invention provides for a method for analyzing a 
sample in order to determine whether the sample contains antibodies which recognize MTV 
polypeptides. This method comprises the steps of: 

i) obtaining a sample suspected of containing antibodies specific for mammary 
tumor viral antibodies; 

ii) obtaining at least one purified MTV polypeptide; 

iii) performing western immunoblot analysis using the sample of step i) and the 
polypeptide of step ii); and, 

iv) analyzing the results of step iii) to determine whether or not antibodies which 
specifically interact with the peptide of step ii) are present in the sample. 

Another aspect of this embodiment of the invention provides for a method for analyzing a 
cell culture or a tissue sample by one or more immunohistological methods in order to determine 
whether the sample contains MTV proteins. This method comprises the steps of: 

i) obtaining a sample suspected of containing mammary tumor viral proteins; 

ii) preparing the sample of step i) for immunochemical analysis; 

iii) incubating the sample of step ii) with one or a combination of two or more 
monoclonal or polyclonal antibodies specific for MTV polypeptides encoded by 
one or more of the MTV DNA sequences described above, wherein the said 
antibodies optionally have a moiety which allows for their specific detection; 

iv) washing the samples to remove antibody which is not specifically bound; 

v) processing the samples as appropriate for the selected detection method; and, 

vi) analyzing the results of step v.) to determine whether or not MTV proteins are 
present in the sample. 

It is envisioned that the immunochemical analysis may include, but is not limited to 
analysis by western blotting or enzyme-linked immunosorbant assay. It will be recognized that 
the selection of the parameters necessary for optimizing the performance of these methods are 
within the abilities of the ordinarily skilled artisan. Analysis may be performed using either 
direct immunochemistry (if the antibodies have been labeled with a detectable moiety) or by 
indirect immunochemistry. 

Other embodiments of the current invention provide for pharmaceutical compositions 
comprising MTV DNA, RNA and/or proteins described supra. These compositions may further 
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comprise a pharmaceutical^ acceptable excipient, carrier, or diluent and do not contain any 
biologically harmful substances. The pharmaceutical compositions of the present invention may 
be formulated by one having ordinary skill in the art. Suitable pharmaceutical formulations are 
described in Remington Pharmaceutical Sciences which is a standard reference text in the field 
5 which is here in incorporated by reference. 

The pharmaceutical compositions may further comprise coloring or stabilizing agents 
osmotic agents, antibacterial agents, or any other substances as long as such substances do not 
interfere with the function of the composition. The pharmaceutical compositions of the 
invents can, for example, be formulated as a solution, suspension, or emulsion in association 
■ o with a pharmaceutical acceptable parenteral vehicle. Examples of such vehicles are water, 
saline, Ringers solution, dextrose solution, and 5% human albumen. Liposomes may also be 
used. The vehicle may contain additives that maintain isotonicity (e.g.. sodium chloride of 
mannitol) and chemical stability (e.g., buffers and preservatives). It should be appreciated thaf 
endotoxin contamination should be kept at a safe level, for example, less than 0.5 ng/mg protein " 
.5 Moreover, for human administration, preparations should meet sterility, pyrogenicity. general 
safety and purity standards as required by the United States Food and Drug Administration 
Office of Biological Standards. The formulations may be sterilized by commonly used 
techniques such as filtration. 

The phrase "pharmaceutically acceptable" refers to substances and compositions which 
20 do not produce an adverse, allergic, or otherwise untoward reaction when administered to an 
animal, or a human, as appropriate. A substance which caused produced any of these adverse 
effects would be classified as "biologically harmful" within the scope of the present invention 
Pharmaceutically acceptable substances and compositions include, but are not limited to 
solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption 
25 delaying agents. Except where incompatible with the invention the use of any conventional 
ingredient is contemplated. Furthermore, supplementary active ingredients which serve some 
other pharmacologically expedient purpose can also be incorporated into the instant 
compositions. 

The methods and compositions described above are contemplated to be of great benefit in 
30 helping to determine whether or not MTV or MTV viral components are present in a person's 
tissues. This knowledge is would be of aid in predicting a patient's susceptibilitv to cancer 
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etiologically derived from MTV. The methods provide a means for determining a patients viral 
load (the amount of virus present in the person's tissues). In this respect it is envisioned that 
tremendous benefit would be achieved by adapting the methods herein described to the 
widespread screening of the general population, and in particular to all mature human females. 
5 Other compositions contemplated as part of various embodiments of the instant invention 

provide a means for stabilizing or reducing the amount of virus present in a human or other 
animal. 

The various methods described above are readily adaptable to preparing diagnostic kits 

for detecting the presence of MTV DNA, RNA, and/or polypeptides. Consequently, for the 
10 clinical practice of the invention, yet other embodiments of the instant invention, which provide 

for diagnostic kits, are contemplated. Such kits are useful for the qualitative and quantitative 

analysis of a sample in order to detect, and perhaps determine the quantity of, MTV DNA, RNA, 

and/or polypeptide present therein. In one aspect of this embodiment of the invention the kit 

includes, as part of its components, one or more recombinant DNA molecules comprising one or 
15 more of the MTV DNA sequences described above. Also according to this aspect of the 

invention the kit may comprise, in addition to (or in alternative to) the recombinant MTV DNA. 

one or more synthetic oligonucleotide primer pairs useful for the PCR amplification of these 

MTV DNA sequences. 

Another aspect of this embodiment of the invention provides for a kit for detecting 
20 antibodies which specifically recognize mammary tumor virus proteins. As part of its 

components this kit includes a reagent comprising one or more polypeptides encoded by the 

MTV DNA sequences described above. 

Also contemplated are kits useful for detecting the presence of MTV proteins by 

immunocytochemistry which comprise one or more antibodies (either monoclonal or polyclonal) 
25 which are specific for at least one MTV polypeptide. Optionally, these antibodies may be 

modified so as to comprise a detection moiety {e.g., a fluorescent die or a radioactive isotope, 

such as 35 S). 

Kits according to these embodiments of the invention may comprise packages, each 
containing one or more of the various reagents (typically in concentrated form) which are 
30 required to perform the respective diagnostic tests. The kits according to these embodiments of 
the invention are contemplated to be useful for detecting and/or quantifying MTV DNA. RNA. 
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and/or protein in biological (or other types of) samples. Such samples may be selected from, but 
are not limited to the following: blood plasma or serum, whole blood, urine, sputum, colonic 
effluent, cerebrospinal fluid, lymphatic fluid, bone marrow, tissue samples (such as from a 
surgical biopsy), or any other sample suspected of containing these biological molecules. 

5 The kits also may further include one or more fiduciary results. As used herein a 

"fiduciary result" refers to a reference standard against which a test outcome is compared to 
gauge the results in terms of quality and/or quantity. A "fiduciary series" is a plurality of such 
references that represent points along a qualitative or a quantitative scale. Preferable in this 
regard, are kits that include a fiduciary series for interpreting results. The fiduciary may be in 

io the form of one or more photographs or may be depicted in other ways, including written 
descriptions. 

Thus fiduciaries may be developed as part of the kits, in accordance with this aspect of 3 
the invention, to guide interpretation of results. In this regard, a RNA or polypeptide^* 
concentration from a biological sample may be characterized in accordance with the foregoing.- 

is Sample may be taken from representative cross-section of patients at various stages of disease 
(including asymptomatic or essentially disease free) to prepare a fiduciary series. 
Characterization in this regard may benefit from hindsight, by following the actual course of 
neoplastic progression in patients as they undergo diagnosis, treatment, and follow up thereafter. 
The determination of viral load (as determined by the abundance of MTV DNA, RNA, or 

20 polypeptide), as set out above, for a variety of patients of known breast cancer status and 
eventual outcome, and the subsequent correlation of these values is of incalculable prognostic 
value. This will help provide the patient with a more accurate prognosis, and it will also aid the 
patient and oncologist in determining the best course of therapeutic treatment, when such 
treatment is necessary. 

25 Another embodiment of the instant invention provides for a composition which induces 

an immunological response against MTV in an animal. A specifically contemplated aspect of 
this invention is a composition which induces an immunological response to human mammary 
tumor viral protein in humans (see Example 7). It is expected that such induced immunity may 
prevent breast cancer in individuals that carry endogenous MTV, by blocking the spread of the 

30 virus to hormonally sensitive tissues. 
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Methods for producing an immune response to viral DNA and/or RNA in animals are 
also known, see for example U.S. patent numbers 5,990,091 and 6,004,799 which are herein 
incorporated by reference. Thus compositions which elicit an immune response to MTV DNA 
and/or RNA are also contemplated as aspect of this embodiment of the invention. 
5 Other embodiments provide for therapeutic compositions which decrease the activity and 

presence of human mammary tumor virus. A common feature of all retroviruses is the presence 
of three enzymes involved in various stages of the viral replication cycle, the reverse 
transcriptase (RT), protease (PR), and integrase (IN). Various pharmaceuticals have been 
developed which target the RT and PR of human immunodeficiency virus (HIV), a retrovirus 
io distantly related to MTV. Furthermore, chemicals which selectively inhibit HIV IN have been 
identified and prototype drugs targeting this enzyme are under development (Hong et ai, 1998; 
Mathe, 1999; Robinson, 1998; Singh et al. 9 2000). According to one aspect of the instant 
invention a pharmaceutical composition comprising one or a mixture of two or more retroviral 
inhibitors in an amount effective to inhibit the activity or the spread of MTV's of the instant 

15 invention is provided. 

Identification of the drug or drugs for use in the pharmaceutical compositions of this 
embodiment can be made using techniques known to those skilled in the art. For examples of 
protease and reverse transcriptase inhibitors and methods of determining the efficacy of such 
drugs as retroviral inhibitors see U.S. Patent Nos. 5,858,738, 6,017,928, and 6,046,228, which 

20 are herein incorporated by reference. The various compositions of this embodiment of the 
invention may comprise, for example, currently known HIV integrase, protease, or reverse 
transcriptase inhibitors; alternatively, such inhibitors can be modified so as to increase their 
specificity for MTV's. Likewise, other classes of HIV inhibitors, such as peptides which are 
analogs of the transmembrane glycoprotein, can serve as pro-drugs to develop specific drugs 

25 which decrease activity and presence of MTV in humans and other species. These compositions 
are contemplated as being useful to inhibit the activity and spread of MTV in humans and other 
species which carry these viruses as endogenous genetic elements. Such drugs could be used to 
treat breast cancer patients afflicted with MTV and would be expected to ameliorate the severity 
and to reduce the recurrence of disease. In addition, such drugs could be used prophylactically to 

30 prevent the occurrence of individuals at high risk for developing the disease. 
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EXAMPLES 

The following examples are included to demonstrate preferred embodiments of the 
invention. It should be appreciated by those of skill in the art that the techniques disclosed in the 
examples which follow represent techniques discovered by the inventor to function well in the 
5 practice of the invention, and thus can be considered to constitute preferred modes for its 
pract.ce. However, those of skill in the art should, in light of the present disclosure, appreciate 
that many changes can be made in the specific embodiments which are disclosed and still obtain 
a like or similar result without departing from the spirit and scope of the invention. 

EXAMPLE 1: MMTV-related sequences in humans 

Sequences which are highly similar (>95%) to the MMTV env gene were amplified by 
PCR from human DNA samples, including subsets of both BC (breast cancer) tissue and non-BC~ 
tissues. The MMTV-related sequences by were found by PCR and a sensitive blotting technique ' 
not only m breast tumors (see Figure 1), but also in the blood of a subset of healthy controls (see " 
Figure 2), and systemic lupus erythematosus (SLE) patients without breast cancer. Our results 
is differ from those of Wang and coworkers (1995) who, with few exceptions, were able to detect 
MMTV-like sequences only in breast' tumors. The sequences from human DNA were distinct 
from the MMTV sequences used as controls in these PCR reactions indicating that our results are 
not simply due to contamination. A ribonuclease protection assay was used to confirm these 
results using a non-PCR based technique to determine that the majority of the PCR positive BC 
20 tissues, but none of the PCR negative tissues, expressed this sequence at the mRNA level Many 
of the products from these PCR reactions have been sequenced. Analysis of these sequences 
proves further strong evidence that PCR contamination is an unlikely explanation for the 
observed results. MMTV env-Iike sequences from different individuals derived in the same PCR 
run were distinct from each other. This result indicates the lack of an ubiquitous PCR 
25 contaminant that would have produced a more consistent sequence that should have been 
identical (or nearly so ) in the various reaction tubes. Furthermore DNA of individual subjects 
produced internally consistent MMTV env-like sequences from PCR run to PCR run The 
vanations within the MMTV-related sequences from a given patient may represent a low number 
of Taq errors, but are also suggestive of variations expected of a replicating retrovirus (reverse 
30 trascriptase errors). 
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Example 2: RNase Protection Assay determination of MTV RNA levels 

Ribonuclease protection assay (RPA) is a quantitative assay which is frequently used by 
those of ordinary skill in the art to determine the of levels of specific RNA species without PCR 
amplification. Briefly, it is performed as follows for the transcripts of the present invention: a 
5 probe of uniform length is synthesized by in vitro transcription of a cloned template and labeled 
to high specific activity. l35] S-labeled riboprobes were prepared from plasmids containing the 
cloned MTV fragments in the 150 to 400 bp size range. The probe was then hybridized to test 
RNA, and RNA fragments that remain unhybridized are not protected from digestion by RNAse 
A/Tl . The fragments of the labeled riboprobe protected by hybridized RNA were visualized and 

io analyzed following separation on a denaturing polyacrylamide gel. Levels of MTV -related 
mRNA are compared in each sample to levels of mRNA produced by the P-actin gene, a 
"housekeeping" gene to ensure integrity of the RNA sample (i.e., that the RNA has not been 
degraded). RPA detected RNA in 2 out of 3 PCR positive breast tumors (see Figure 3). RPA is 
10-15 times more sensitive than "northern" analysis for detection of rare messenger RNA. RPA 

15 analysis is performed directly on total RNA, without any prior manipulation, which can 
introduce errors in the quantitative analysis. 

Example 3: RT-PCR determination of MTV RNA levels 

MTV mRNAs may also be detected using real time RT-PCR (reverse transcriptase linked 
polymerase chain reaction). For example the I-Cycler (Biorad), with a fluoroscopic detection 
20 facility is capable of determining the real time kinetics of the PCR amplification product by 
quantifying the PCR product in the log-linear phase of the PCR reaction. Using this method, a 
real-time thermocycler can reliably compare miniscule amounts of known nucleotide template in 
a reproducible fashion. 

Example 4: Detection of anti-MTV antibodies in blood by Western blot analysis 

25 The InsectSelect™ System (Invitrogen) which allows stable production of recombinant 

proteins in insect cells in a manner similar to well -characterized baculovirus expression systems 
may be used to generate recombinant MTV proteins. This is a virus-free system which allows 
creation of stable cell lines that continuously produce high-quality protein. Stable cell lines 
generated in about 9 days may be used for continuous, long-term production of recombinant 

30 proteins. The InsectSelect™ expression system is based on a plasmid vector which carries an 
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antibiotic resistance gene for selecting stably expressing insect cell lines. This expression vector 
uses the immediate early promoter, OpIE2, from the Douglas Fir Tussoc moth OpMNPV 
baculovirus for gene expression. OpIE2 is a strong transcriptional promoter in lepidopterin (Sf9, 
5121, High Five™ (Invitrogen)) as well as mosquito and dipterin cell lines. The expression 
5 vector pIZ/V5-His (Invitrogen) has a multiple cloning site and several features that simplify 
production and analysis of recombinant proteins in insect cells. It includes a Zeocin-resistance 
gene that allows for rapid selection of stably transfected cells, a C-terminal tag encoding the V5 
epitope, and a C-terminal polyhistidine (6xHis) sequence. These latter features facilitate rapid 
protein detection with anti-V5 antibodies (Invitrogen) and protein purification with resins that 

10 bind the polyhistidine. Alternative eukaryotic expression systems for MMTV related proteins 
are an in vitro rabbit reticulocyte lysate system (Promega) with in vitro transcribed and capped 
mRNA or the Sindbis virus Expression system (Invitrogen). "± 
MMTV-related genes may also be cloned into a pGEX vector (Pharmacia) and expressed 
in bacteria to make recombinant fusion protein with glutathione S-transferase (GST) protein for 

15 immunoblot studies. This system has several advantages compared to other methodologies of 
protein expression, not the least of which is the ease of production, isolation, and purification of 
the recombinant protein. Fusion proteins generated with the GST protein typically remain 
soluble permitting the recovery from cell lysates. Denaturing conditions are not required during 
purification and therefore the antigenic and enzyme properties of the protein are often 

20 maintained. Moreover, the GST fusion protein may be efficiently and rapidly purified by 
immobilizing the GST on glutathione coated beads or columns and eluted with reduced 
glutathione. 

To perform a western immunoblot, MTV proteins are dissolved in lysis buffer (0.25 M 
Tris Base, pH 6.8, containing 4% sodium dodecyl sulfate, 10% dithiothreitol, 20% glycerol, and 

25 0.01% w/v bromophenol blue), heated to 100°C for 3 min and subjected to sodium dodecyl 
sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) on a 10% polyacrylamide gel. These 
proteins may then be transferred electrophoretically to a nitrocellulose membrane (Protran™; 
Schleicher & Schuell) in Tris-glycine (pH 8.3) buffer with 20% methanol. The blots may be 
incubated overnight in blocking buffer (0.02 M Tris, 0.1 M NaCl, heat inactivated goat serum, 

30 0.01% thimerosal, and 5% nonfat dry milk) with serum or plasma from the human or animal 
being tested (1:100 dilution or optimal dilution). Incubation with secondary antibodies, 
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biotinylated anti-human (or other species) IgG goat antibodies diluted 1:500 or 1:1,000 in 
blocking buffer, and with avidin-horseradish peroxidase may be performed at room temperature 
for 2 hours. The immunoblots may be developed with 7.8 mM 4-chloro-l-napthol and 0.03% 
hydrogen peroxide. Bands corresponding to MTV proteins are quantitated by scanning and 
5 processed with image analysis software (NIH Image). 

Example 5: Detection of anti-MTV antibodies by enzyme linked immunoassay 

Antibodies which specifically bind to MTV protein(s) may be detected using an enzyme- 
linked immunoassay with MTV protein or proteins as the target for the antibodies. In this 
technique MTV proteins produced in an expression system, such as the baculovirus/insect cell 

io clutter system or purified from virus preparations may be bound to the bottom of wells in a 
multiwell plastic microtiter plate. Serum or plasma from humans or other species is diluted to an 
empirically determined optimum dilution and incubated from 1 hour to overnight at about 25 °C 
(room temperature). The wells are then washed three times in a saline solution containing 
Tween® 20 (Aldrich) orNP-40 (currently available as Igepal™ CA-630, Sigma) detergents using 

is an automated plate washer. Antibodies bound to the MTV proteins may then be detected by 
reacting the wells sequentially . with buffers containing biotinylated goat anti-human 
immunoglobulin, followed by avidin coupled to horseradish peroxidase, and finally 3,3', 5,5'- 
tetramethylbenzidine a substrate for horseradish peroxidase which produces a colored reaction 
product. Between each step the well is typically washed three times in a saline solution 

20 containing Tween® 20 or NP-40 detergents using an automated plate washer. The colored 
reaction produced in each well is quantitated using a spectrophotometer plate reader. The 
amount of colored reaction product is proportional to the amount of antibodies to MTV present 
in the original sample. 

Example 6: Detection of MTV retroviral proteins in situ using immunohistochemistry 

25 MTV retroviral proteins may be detected in tissue samples by immunohistochemical 

assays. In this technique MTV proteins produced in an expression system, such as the 
baculovirus/insect cell clutter system or purified from virus preparations may be used to generate 
monoclonal or polyclonal antibodies by methods well known to those skilled in the art. These 
antibodies may then be labeled with a detection moiety such as biotin, fluorescein, horseradish 

30 peroxidase or other labels used for this purpose. The labeled antibody preparations are then 
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diluted to an empirically determined optimum dilution and incubated with fixed or frozen thin 
sections or cell preparations from samples to be tested for the presence of MTV proteins. The 
samples are processed, as dictated by the type of label used, and the binding of the antibody to 
MTV retroviral proteins may be detected by microscopy, autoradiography, flow cytometry, etc., 
5 again, as dictated by the identity of the labeling moiety. 

Example 7: Preparation of a pharmaceutical composition capable of eliciting an 

immunological response, against human mammary tumor virus, in humans. 

A pharmaceutical composition to be used to induce an immunological response an in 

humans or animals to MTV protein may be developed as follows. MTV proteins produced in an 

10 expression system, such as the baculovirus/insect cell culture system described above or purified 

from virus preparations may be extensively purified using column chromatography and/or any 

other methodologies commonly employed by skilled artisans. The purified MTV proteins may; 

then be mixed or emulsified with a suitable adjuvant preparation (currently alum is the only^ 

adjuvant approved for use in humans) to produce a vaccine. This immunogenic composition 

15 may then be injected subcutaneously or intradermally into the target animal. 

All of the composition and methods disclosed and claimed herein can be made and 
executed without undue experimentation in light of the present disclosure. While the 
compositions and methods of this invention have been described in terms of preferred 
embodiments, it will be apparent to those of skill in the art that variations may be applied to the 

20 compositions and methods and in the steps or in the sequence of steps of the method described 
herein without departing from the concept, spirit and scope of the invention. More specifically, 
it will be apparent that certain agents which are both chemically and physiologically related may 
be substituted for the agents described herein while the same or similar results would be 
achieved. All such similar substitutes and modifications apparent to those skilled in the art are 

25 deemed to be within the spirit, scope and concept of the invention as defined by the appended 
claims. 
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CLAIMS: 

1 . An isolated DNA molecule comprising a DNA sequence that has: 

a) at least 99% identity with SEQ ID NO: 2, 3 5 4, 5, 7, 8, 21, 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous bases 
of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence represented by 
nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous nucleotides of the 
sequence represented by nucleotides 1-400 of SEQ ID NO:4, at least 130 contiguous 
nucleotides of the sequence represented by nucleotides 1-360 of SEQ ID NO:5, at 
least 130 contiguous nucleotides of SEQ ID NO:6, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-380 of SEQ ID NO:7 5 at 
least 2 1 0 contiguous nucleotides of the sequence represented by nucleotides 20-462 
of SEQ ID NO:8, at least 160 contiguous nucleotides of the sequence represented by 
nucleotides 97-462 of SEQ ID NO:21, at least 170 contiguous nucleotides of SEQ ID 
NO:23, at least 100 contiguous nucleotides of the sequence represented by 
nucleotides 337-462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the 
sequence represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 
contiguous nucleotides of SEQ ID NO:29; or 

c) at least 92% identity with SEQ ID NO: 10. 

2. The isolated DNA molecule of claim 1 which comprises SEQ ID NO: 2, 3, 4, 5, 6, 7, 8, 
10, 21, 23, 25, 27, or 29. 

3. The isolated DNA molecule of claim 1 wherein the DNA sequence is derived from 
human, rhesus macaque, or cat. 

4. The isolated DNA molecule of claim 1 s further comprising a detection moiety. 

5. The isolated DNA molecule of claim 1, which is suspended or dissolved in a diluent. 

6. The isolated DNA molecule of claim 5 wherein the diluent is a buffered aqueous 
solution. 
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aic017.The isolated DNA molecule of claim 5 wherein the DNA is present at a concentration of 
from about 0.1 ng/|il to 100 ng/fal. 

8. A purified polypeptide encoded by a DNA molecule comprising a DNA sequence that 
has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21, 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous bases 
of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence represented by 
nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous nucleotides of the 
sequence represented by nucleotides 1-400 of SEQ ID NO:4, at least 130 contiguous 
nucleotides of the sequence represented by nucleotides 1-360 of SEQ ID NO:5, at 
least 130 contiguous nucleotides of SEQ ID NO:6, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-380 of SEQ ID NO:7, at 
least 210 contiguous nucleotides of the sequence represented by nucleotides 20-462 
of SEQ ID NO:8, at least 160 contiguous nucleotides of the sequence represented by 
nucleotides 97-462 of SEQ ID NO:21, at least 170 contiguous nucleotides of SEQ ID 
NO:23, at least 100 contiguous nucleotides of the sequence represented by 
nucleotides 337-462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the 
sequence represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 
contiguous nucleotides of SEQ ID NO:29; or 

c) at least 92% identity with SEQ ID NO: 1 0. 

9. The purified polypeptide of claim 8 which is SEQ ID NO: 12, 13, 14, 15, 16, 17, 18, 20, 
22, 24, 26, 28, or 30. 

ac03 1 0. The purified polypeptide of claim 8 which is present in a pharmaceutical 

composition. 

11. An antibody which recognizes a polypeptide encoded by a DNA molecule comprising a 
DNA sequence that has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21, 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous bases 
of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence represented by 
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nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous nucleotides of the 
sequence represented by nucleotides 1-400 of SEQ ID NO:4, at least 130 contiguous 
nucleotides of the sequence represented by nucleotides 1-360 of SEQ ID N05 at 
least 130 contiguous nucleotides of SEQ ID NO:6, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-380 of SEQ ID NO? at 
least 210 contiguous nucleotides of the sequence represented by nucleotides 2 0 '-462 
of SEQ ID NO:8, at least 160 contiguous nucleotides of the sequence represented by 
nucleotide, 97-462 of SEQ ID NO:21, at least 170 contiguous nucleotides of SEQ ID 
NO:23, at least 100 contiguous nucleotides of the sequence represented by 
nucleotides 337-462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the 
sequence represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 
contiguous nucleotides of SEQ ID NO:29; or 
c) at least 92% identity with SEQ ID NO: 1 0; 
wherein said antibody was generated using said polypeptide. 
The antibody of claim 1 1 which is a monoclonal antibody. 

An RNA corresponding to a DNA molecule a DNA molecule comprising a DNA 
sequence that has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21, 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous bases 
of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence represented by 
nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous nucleotides of the 
sequence represented by nucleotides 1-400 of SEQ ID NO:4, at least 130 contiguous 
nucleotides of the sequence represented by nucleotides 1-360 of SEQ ID N0 5 at 
least 130 contiguous nucleotides of SEQ ID NO:6, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-380 of SEQ ID N07 at 
least 210 contiguous nucleotides of the sequence represented by nucleotides 20-462 
of SEQ ID NO:8, at least 160 contiguous nucleotides of the sequence represented by 
nucleotides 97-462 of SEQ ID NO:21, at least 170 contiguous nucleotides of SEQ ID 
NO:23, at least 1 00 contiguous nucleotides of the sequence represented by 
nucleotides 337-462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the 
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sequence represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 
contiguous nucleotides of SEQ ID N0.29; or 

c) at least 92% identity with SEQ ID NO: 1 0. 

14. The isolated DNA molecule of claim 1 which is incorporated in a vector; 

5 wherein said DNA sequence is under the transcriptional control of a heterologous 

promoter. 

15. The vector of claim 14 which is capable of expressing the DNA sequence in at least one 
of the following cell types: insect cells, bacterial cells, avian cells, yeast cells, or 
mammalian cells. 

io 16. The vector of claim 14 wherein said vector is capable of episomal replication or 

chromosomal integration in at least one of the following cell types: insect cells, bacterial 
cells, avian cells, yeast cells, or mammalian cells. 

1 7. A method for the detecting the presence of a mammary tumor viral DNA comprising the 
steps of: 

15 0 obtaining a biological sample suspected of containing DNA which comprises a 

sequence molecule a DNA molecule comprising a DNA sequence that has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21, 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous 
bases of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence 

20 represented by nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous 

nucleotides of the sequence represented by nucleotides 1-400 of SEQ ID NO:4, at 
least 130 contiguous nucleotides of the sequence represented by nucleotides 1-360 
of SEQ ID NO:5, at least 130 contiguous nucleotides of SEQ ID NO:6, at least 
140 contiguous nucleotides of the sequence represented by nucleotides 1-380 of 

25 SEQ ID NO:7, at least 210 contiguous nucleotides of the sequence represented by 

nucleotides 20-462 of SEQ ID NO:8, at least 160 contiguous nucleotides of the 
sequence represented by nucleotides 97-462 of SEQ ID NO:21, at least 170 
contiguous nucleotides of SEQ ID NO:23, at least 1 00 contiguous nucleotides of 
the sequence represented by nucleotides 337-462 of SEQ ID NO:25, at least 300 
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contiguous nucleotides of the sequence represented by nucleotides 85-462 of SEQ 
ID NO:27, or at least 200 contiguous nucleotides of SEQ ID NO:29; or 
c) at least 92% identity with SEQ ID NO:10; 

ii) carrying out a polymerase chain reaction to amplify the suspected DNA sequence 
as defined in step i); and, 

iii) determining the sequence of, or otherwise characterizing, the amplicons produced 
in step ii) to determine whether or not the DNA sequence as defined in step i) is 
present in the sample. 

1 8. The method of claim 1 7 wherein the biological sample is obtained from human, rhesus 
macaque, or cat. 19. A method of detecting the presence of antibodies which recognize 
one or more mammary tumor virus polypeptides, the method comprising the steps of: 

i) obtaining a sample suspected of containing antibodies specific for mammary 
tumor viral antibodies; 

ii) obtaining at least one purified polypeptide encoded by a DNA molecule 
comprising a DNA sequence that has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8. 21, 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 
contiguous bases of SEQ ID NO:2, at least 1 50 contiguous nucleotides of the 
sequence represented by nucleotides 1-380 of SEQ ID NO:3, at least 140 
contiguous nucleotides of the sequence represented by nucleotides 1-400 of 
SEQ ID NO:4, at least 130 contiguous nucleotides of the sequence represented 
by nucleotides 1-360 of SEQ ID NO: 5, at least 130 contiguous nucleotides of 
SEQ ID NO:6, at least 140 contiguous nucleotides of the sequence represented 
by nucleotides 1-380 of SEQ ID NO:7, at least 210 contiguous nucleotides of 
the sequence represented by nucleotides 20-462 of SEQ ID NO:8, at least 160 
contiguous nucleotides of the sequence represented by nucleotides 97-462 of 
SEQ ID NO:21, at least 170 contiguous nucleotides of SEQ ID NO:23, at least 
100 contiguous nucleotides of the sequence represented by nucleotides 337- 
462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the sequence 
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represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 
contiguous nucleotides of SEQ ID NO:29; or 
c) at least 92% identity with SEQ ID NO: 1 0; 

iii) performing an immunochemical analysis using the sample of step i) and the 
5 polypeptide of step ii); and, 

iv) analyzing the results of step iii) to determine whether or not antibodies which 
specifically interact with the peptide of step ii) are present in the sample. 

20. The method of claim 1 9 wherein the immunochemical analysis of step iii) is a Western 
blot analysis. 

io 21 . The method of claim 19 wherein the immunochemical analysis of step iii) is 
enzyme-linked immunosorbant assay (ELIZA) analysis. 

22. The method of claim 19 wherein one or more of the polypeptides are derived from the 
en\\ or gag, gene and the antibodies are specific for one or more of the following 
polypeptides SEQ ID NO: 12, 13, 14, 15, 16, 17, 18, 20, 22, 24, 26, 28, or 30. 

The method of claim 19 wherein the sample of step ii) comprises antibodies derived from 
a human, rhesus macaque, or cat. 

24. A diagnostic kit for detecting DNA or RNA from a mammary tumor virus, said kit 
comprising a reagent of a DNA molecule that comprising a DNA sequence that has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21, 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous bases 
of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence represented by 
nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous nucleotides of the 
sequence represented by nucleotides 1-400 of SEQ ID NO:4, at least 130 contiguous 
nucleotides of the sequence represented by nucleotides 1-360 of SEQ ID NO:5, at 

25 le *st 130 contiguous nucleotides of SEQ ID NO:6, at least 1 40 contiguous 

nucleotides of the sequence represented by nucleotides 1-380 of SEQ ID NO:7, at 
least 210 contiguous nucleotides of the sequence represented by nucleotides 20-462 
of SEQ ID NO:8, at least 160 contiguous nucleotides of the sequence represented by 
nucleotides 97-462 of SEQ ID NO:21, at least 1 70 contiguous nucleotides of SEQ ID 

30 NO:23, at least 100 contiguous nucleotides of the sequence represented by 
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nucleotides 337-462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the 
sequence represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 
contiguous nucleotides of SEQ ID NO:29; or 
c) at least 92% identity with SEQ ID NO: 1 0. 

5 25. A diagnostic kit for detecting antibodies to mammary tumor virus, where said kit 
comprises a reagent comprises one or both of the following: 
i) one or more polypeptides encoded by a DNA molecule comprising a DNA 
sequence that has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8 ? 21 , 25, or 27; or 
io b) at least 99% identity with one or more of the following: at least 340 contiguous 

bases of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence 
represented by nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-400 of SEQ ID NO:4, at 
least 130 contiguous nucleotides of the sequence represented by nucleotides 1-360 
15 of SEQ ID NO:5, at least 1 30 contiguous nucleotides of SEQ ID NO:6, at least 

140 contiguous nucleotides of the sequence represented by nucleotides 1-380 of 
SEQ ID NO:7, at least 210 contiguous nucleotides of the sequence represented by 
nucleotides 20-462 of SEQ ID NO:8, at least 160 contiguous nucleotides of the 
sequence represented by nucleotides 97-462 of SEQ ID NO:21, at least 170 
20 contiguous nucleotides of SEQ ID NO:23, at least 100 contiguous nucleotides of 

the sequence represented by nucleotides 337-462 of SEQ ID NO:25, at least 300 
contiguous nucleotides of the sequence represented by nucleotides 85-462 of SEQ 
ID NO:27, or at least 200 contiguous nucleotides of SEQ ID NO:29; or 
c) at least 92% identity with SEQ ID NO: 10; or 
25 ii) an antibody specific for a polypeptide according to step i). 

26. A method for the detection of MTV RNA in a sample which comprises the following 
steps: 

i) obtaining an sample suspected of containing RNA which encoded by one or more 
DNA sequences that have: 
30 a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21, 25 ? or 27; or 
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b) at least 99% identity with one or more of the following: at least 340 contiguous 
bases of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence 
represented by nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-400 of SEQ ID NO:4, at 
least 130 contiguous nucleotides of the sequence represented by nucleotides 1-360 
of SEQ ID NO:5, at least 130 contiguous nucleotides of SEQ ID NO:6, at least 
140 contiguous nucleotides of the sequence represented by nucleotides 1-380 of 
SEQ ID NO:7, at least 210 contiguous nucleotides of the sequence represented by 
nucleotides 20-462 of SEQ ID NO:8, at least 160 contiguous nucleotides of the 
sequence represented by nucleotides 97-462 of SEQ ID NO:21, at least 1 70 
contiguous nucleotides of SEQ ID NO:23, at least 100 contiguous nucleotides of 
the sequence represented by nucleotides 337-462 of SEQ ID NO:25, at least 300 *r 
contiguous nucleotides of the sequence represented by nucleotides 85-462 of SEQ ~ 
ID NO:27, or at least 200 contiguous nucleotides of SEQ ID NO:29; or t 
c) at least 92% identity with SEQ ID NO: 10;; 
iv) carrying out an RNAse protection assay (RPA); and, 

iii) analyzing the RPA results to determine whether RNA as defined in step i) is . 
present in the sample and optionally quantitating said RNA. 
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SEQUENCE LISTING 

<110> Garry, Robert F. 

5 <120> Human Endogenous Retrovirus in Breast Cancer 

<130> TUMC:012 

<140> 
10 <141> 



35 



20 



25 



30 



<160> 30 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 461 
<212> DNA 

<213> Mouse mammary tumor virus 
<400> 1 

tcccttccct cgcctagtgt agatcagtca" 
ggaaattata ctccccctgt caataaagag 
gaacctacat ggttctggga aaattctcct 
ctagttcccc atacagaatt gtttcgctta 
aggccaggat ttcaagaaca tgacatgatt 
gccatattat taggattacc tcagctaata 
atttcctgtt cttcttgtag attgactaat 
gcgatcatag tcaagaggcc gccatacgtg 



gatcagatta 
gttcatcgat 
aaggatccca 
gttgcagcct 
cctacatctg 
gatatagaga 
tgtttagatt 
ctgctacctg 



aaagcaaaaa 
ggtatgaagc 
atgatagaga 
caagatatct 
cctgtgttac 
aaagaggatc 
cttctgccta 
t 



ggatctattt 60 
aggatgggta 12 0 
ttttactgct 180 
tattctcaaa 240 
ttacccttat 300 
tacttttcat 360 
cgactatgca 420 
461 



<210> 2 
<211> 461 
<212> DNA 
35 <213> Human Mammary Tumor Virus 



40 



45 



<400> 2 

tcccttccct 

ggaaattata 

gaacctacat 

ctagttcccc 

aggccaggat 

gccatattat 

atttcctgtt 

gcgatcatag 



cgcctagtgt 
ctccccctgt 
ggttctggga 
atacagaatt 
ttcaagaaca 
taggattacc 
cttcttgtag 
tcaagaaggc 



agatctgtca 
caataaagag 
aaattctcct 
gtttcgctta 
tgacatgatt 
tcagctaata 
attgactaat 
gccatatgtg 



gatcagatta 
gttcatcgat 
aaggatccca 
gttgcagcct 
cctacatctg 
gatatagaaa 
tgtttagatt 
ctgctacctg 



aaagcaaaaa 
ggtatgaagc 
atgatagaga 
caagatatct 
cctgtgttac 
aaagaggatc 
cttctgccta 
t 



ggatctattt 60 
aggatgggta . 120 
ttttactgct 180 
tattctcaaa 240 
ttacccttat 300 
tacttttcat 360 
cgactatgca 420 
461 



<210> 3 

<211> 461 

50 <212> DNA 

<213> Human Mammary Tumor Virus 

<400> 3 

tcccttccct cgcctagtgt agatcagtca gatcagatta aaaacaaaaa ggatctattt 60 
55 ggaaattata ctccccctgt caataaagag- gttcatcgat ggtatgaagc aggatgggta 12 0 
gaacctactt ggttctggga aaattctcct aaggatccca atgatagaga ttttactgct 180 
ctagttcccc atacagaatt gtttcgctta gttgcagcct caagacatct tattctcaaa 240 
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aagccaggac ttcaagaaca tgagatgatt cctacatctg cctgtgttac ttacccttat 

gccatattat taggattacc tcagctaata gatatagaga aaagaggatc tacttttcat 

atttcctgtt cttcttgttg attgactaat tgtttagatt cttctgccta cgactatgca 

gcgatcatag tcaagaggcc gccatacgtg ctgctacctg t 

5 

<210> 4 
<211> 461 
<212> DNA 
10 <213> Human Mammary Tumor Virus 

<400> 4 

tcccttccct cgcctagtat agaacagtca aatcagatta aaagcaaaaa ggatctactt 60 

ggaaattata ctccccctgt caataaagag gttcatcgat ggtatgaagc aggatgggta 12 0 

15 gaacctactt ggttctggga aaattctcct aaggatccca atgatagaga ttttactgct 180 

ctagttcccc atacagaatt gtttcgctta gttgcagcct caagacatct tattctcaaa 240 

aggccaggat ttcaagaaca tgagatgatt cctacatctg cctgtgttac ttacccttat 300 

gccatattat taggattacc tcagctaata gatatagaga agagaggatc tacttttcat 360 

atttcctgtt cttcttgtag attgactaat tgtttagact cttctgccta cgactatgca 420 

20 gcgatcatag tcaagaggcc gccatacgtg ctgctacctg t 461 



300 
360 
420 
461 



<210> 5 

<211> 461 

25 <212> DNA 

<213> Rhesus Mammary Tumor Virus 



<400> 5 
tcccttccct 

30 ggaaattata 
gaacctactt 
ctagttcccc 
aagccaggat 
gccatattat 

35 atttcctgtt 
gcgatcatag 



cgcctagtgt 
ctccccctgt 
ggttctggga 
atacagaatt 
ttcaagaaca 
taggattacc 
cttcttgtag 
tcaagaggcc 



agatcagtca 
caataaagag 
aaattctcct 
gtttcgctta 
tgagatgatt 
tcagctaata 
attgactaat 
gccatacgtg 



aatcagatta 
gttcatcgat 
aaagatccca 
gttgcagcct 
cctacatctg 
gatatagaga 
tgtttagatt 
ctgctacctg 



aaagcaaaaa 
ggtatgaagc 
atgatagaga 
caagacatct 
cctgtgttac 
aaagaggatc 
cttctgccta 
t 



ggatctattt 60 
aggatgggta 12 0 
ttttactgct 180 
tattctcaaa 240 
ttacccttat 300 
tacttttcat 360 
cgactatgca 420 
461 



<210> 6 
40 <211> 461 
<212> DNA 

<213> Rhesus Mammary Tumor Virus 
<400> 6 

45 tcccttccct cgcctagtgt agatcagtca 

ggaaattata ctccccctgt caataaagag 

gaacctactt ggttctggga aaattctcct 

ctagttcccc atacagaatt gtttcgctta 

aagccaggat ttcaagaaca tgagatgatt 

50 gccatattat taggattacc tcagctaata 

atttcctgtt cttcttgtag accgactaat 

gcgatcatag tcaagaggcc gccatacgtg 



aatcagatta aaagcaaaaa ggatctattt 60 

gttcatcgat ggtatgaagc aggatgggta 12 0 

aaagatccca atgatagaga ttttactgct 180 

gttgcagcct caagacatct tattctcaaa 240 

cctacatctg cctgtgttac ttacccttat 300 

gatatagaga aaagaggatc tacttttcat 36 0 

tgtttagatt cttctgccta cgactatgca 420 

ctgctacctg t 461 



55 <210> 7 

<211> 461 
<212> DNA 
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<213> Cat Mammary Tumor Virus 
<400> 7 



10 



33 SSta Sc«S 3 t SSK" »««.ctt ,0 

9a .cc„c,t S9 ttc C33 s :s"s? s 33 ss; ssss :??r s39 " 120 

ctagttcccc atacagaatt ottteoetf. J+?H atgatagaga ttttattgct 180 

aagccaggat ttcaagaaca SIS«t? I'? Caa 9 acatct tattctcaaa 240 

gccatattat taggaltacc =ctacatctg cctgtgttac ttacccttat 3 00 

atttcctgtt ctfcttgtag at^acta^ Pr"* 9 * 9 * aaa ^9"c tacttttcat 360 
gcg.tc.Lg tcaagag^c gccl^g 2£2£ "° 

4 61 



<210> 8 
15 <211> 461 
<2I2> DNA 

<213> Cat Mammary Tumor Virus 



<400> 8 



20 



25 



iztitiitt sss ssfs rr u a " 9 ""** *«<«<« .» 

9 ..==tac.c 3gtt = C33 3 , S""S? .: 33 "S, I?SSS ;??r 999 " 120 

ctagttcccc atacagaatt ottteoetta n+rZZ ! atgatagaga ttttactgct 180 
aagccaggat ttcaagaaca Sg^gatc SJSfeS c^T" ta " Ctcaaa 
gccatattat taagattacc tcaaet«»*. c " acatct 9 cctgtgttac ttacccttat 300 
atttcctgtt cttcttgtag att^ctaat ^ S aaa ^9"c tacttttcat 360 
gcgatcatag tcaagaggcc gcca^cgcg ^gct^cctg C9.ct.tgc. 420 

4 63. 



30 <210> 9 

<211> 104 
<212> DNA 
- <213> Mouse mammary tumor virus 

35 <400> 9 



suss ssss ss=s :sss ssr— ~* « 



40 <210> 10 
<2ll> 104 
<212> DNA 

<213> Cat Mammary Tumor Virus 



4 Lisls ssss tssss :sss sr- — « 



50 <210> 11 
<211> 153 
<212> PRT 

<213> Mouse mammary tumor virus 
55 <400> li 



Ser L eu Pro Ser Pro Ser Val Asp Gin Ser Asp Gin He Lys Ser Lys 



io 15 
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Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
20 25 30 

5 Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
35 40 45 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
50 55 60 

10 

Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg Tyr Leu lie Leu Lys 
65 70 75 80 

Arg Pro Gly Phe Gin Glu His Asp Met lie Pro Thr Ser Ala Cys Val 
15 85 90 95 

Thr Tyr Pro Tyr Ala lie Leu Leu Gly Leu Pro Gin Leu lie Asp lie 
100 105 110 

20 Glu Lys Arg Gly Ser Thr Phe His lie Ser Cys Ser Ser Cys Arg Leu 
115 120 125 

Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala He He Val 
130 135 140 

25 

Lys Arg Pro Pro Tyr Val Leu Leu Pro 
145 150 



30 <210> 12 
<211> 153 
<212> PRT 

<213> Human Mammary Tumor Virus 
35 <400> 12 

Ser Leu Pro Ser Pro Ser Val Asp Leu Ser Asp Gin He Lys Ser Lys 
15 10 15 

Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
40 2 0 2 5 3 0 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
35 40 45 

45 Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
50 55 60 

Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg Tyr Leu He Leu Lys 
65 70 75 80 

50 

Arg Pro Gly Phe Gin Glu His Asp Met He Pro Thr Ser Ala Cys Val 
85 90 95 

Thr Tyr Pro Tyr Ala He Leu Leu Gly Leu Pro Gin Leu He Asp He 
55 100 105 110 

Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Leu 
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125 

Thr Asn cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala lie He Val 
- 135 140 

Lys Lys Ala Pro Tyr Val Leu Leu Pro 
14 * 150 

10 <210> 13 
<211> 152 
<212> PRT 

<213> Human Mammary Tumor Virus 
15. <400> 13 

Ser Leu Pro Ser Pro Ser Val Asp Gin Ser Asp Gin He Lys Asn Lys 



10 



15 



^ Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val 



25 



His 



30 



Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu 
35 4 ° 45 



Asn 



25 Ser Pro Lys Asp Pro Asn Asp Arg 



50 



55 



Asp Phe Thr Ala Leu Val Pro His 



60 



Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg His Leu He Leu Lys 

7 0 nr 

30 75 80 

Lys Pro Gly Phe Gin Glu His Glu Met He Pro Thr Ser Ala Cys Val 

85 90 95 

^ Thr Tyr Pro Tyr Ala He Leu Leu Gly Leu Pro Gin Leu He Asp He 
100 105 



110 



Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Leu Thr 

120 12 5 

40 Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala He He Val Lys 

135 140 

Arg Pro Pro Tyr Val Leu Leu Pro 
145 150 

45 

<210> 14 

<211> 153 

<212> PRT 

50 <213> Human Mammary Tumor Virus 



<400> 14 

Ser Leu Pro Ser Pro 



Ser He Glu Gin Ser Asn Gin Il € 



2 ~~ ^ c ^ u ^-Ln ser Asn Gin He Lys Ser Lys 



10 1S 



Lys Asp Leu Leu Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
20 25 30 
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Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
35 40 45 

5 Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
50 55 60 

Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg His Leu lie Leu Lys 
65 70 75 80 

10 

Arg Pro Gly Phe Gin Glu His Glu Met lie Pro Thr Ser Ala Cys Val 
8 5 90 95 

Thr Tyr Pro Tyr Ala lie Leu Leu Gly Leu Pro Gin Leu lie Asp lie 
15 100 105 110 

Glu Lys Arg Gly Ser Thr Phe His lie Ser Cys Ser Ser Cys Arg Leu 
115 120 125 

20 Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala lie lie Val 
130 135 140 

Lys Arg Pro Pro Tyr Val Leu Leu Pro 
145 150 

25 

<210> 15 

<211> 153 

<212> PRT 

30 <213> Rhesus Mammary Tumor Virus 

<400> 15 

Ser Leu Pro Ser Pro Ser Val Asp Gin Ser Asn Gin lie Lys Ser Lys 
15 10 15 

35 

Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
20 25 30 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
40 3 5 40 45 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
50 55 60 

45 Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg His Leu lie Leu Lys 
65 70 75 80 

Lys Pro Gly Phe Gin Glu His Glu Met lie Pro Thr Ser Ala Cys Val 
85 90 95 

50 

Thr Tyr Pro Tyr Ala lie Leu Leu Gly Leu Pro Gin Leu lie Asp lie 
100 105 110 

Glu Lys Arg Gly Ser Thr Phe His lie Ser Cys Ser Ser Cys Arg Leu 
55 115 120 125 

Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala lie lie Val 
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130 135 

Lys Arg Pro Pro Tyr Val Leu Leu Pro 
145 150 



140 



5 



<210> 16 
<211> 153 
<212> PRT 

JO <213> Rhesus Mammary Tumor Virus 
<400> 16 

Ser Leu Pro Ser Pro Ser Val Asp Gin Ser Asn Gin lie Lys Ser Lys 
,5 5 10 " 

Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
20 25 " 30 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
1K) 35 40 45 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
50 55 60 

25 Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg His Leu He Leu Lys 



Lys Pro Gly Phe Gin Glu His Glu Met He Pro Thr Ser Ala Cys Val 

85 90 SB 

Thr Tyr Pro Tyr Ala He Leu Leu Gly Leu Pro Gin Leu He Asp He 
100 105 110 

Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Pro 
H5 12 0 



125 



Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala He He Val 
130 135 140 

40 Lys Arg Pro Pro Tyr Val Leu Leu Pro 
145 150 

<210> 17 
45 <211> 153 
<212> PRT 

<213> Cat Mammary Tumor Virus 
<400> 17 

50 Ser Leu Pro Ser Pro Ser Val Glu Gin Ser Asp Gin He Lys Ser Lys 

5 10 15 



55 



Lys Asp Leu Leu Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
20 2S 30 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 

40 45 
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Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 

55 60 

5 Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg His Leu He Leu Lys 



75 



80 



Lys Pro Gly Phe Gin Glu His Lys Met He Pro Thr Ser Ala Cys Val 

85 90 95 

Thr Tyr Pro Tyr Ala He Leu Leu Arg Leu Pro Gin Leu He Asp He 
100 1Q 5 110 

^ Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Leu 

Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala He He Val 



130 135 

20 Lys Arg Pro Pro Tyr Val Leu Leu Pro 
145 15 0 



<210> 18 

25 <211> 153 

<212> PRT 

<213> Cat Mammary Tumor Virus 



140 



<400> 18 

30 Ser Leu Pro Ser Pro Ser Val Glu Gin Ser Asp Gin He Lys Ser Lys 

15 



10 



Lys Asp Leu Leu Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 

20 25 30 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
J5 40 45 

40 P so ^ ** 5 ASP Phe Thr Ala ^u Val Pro His 

55 60 

Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg His Leu He Leu Lys 

70 ? 5 80 

45 Lys Pro Gly Phe Gin Glu His Lys Met lie Pro Thr Ser Ala Cys Val 

85 9° 95 

Thr Tyr Pro Tyr Ala He Leu Leu Arg Leu Pro Gin Leu He Asp He 
50 1 05 HO 

Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Leu 

120 125 

^ ^ So ^ Ala ^ r ASP ^ Ala Ala "« 1^ Val 

135 140 
Lys Arg Pro Pro Tyr Val Leu Leu Pro 
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9- 



145 150 



<210> 19 
5 <211> 35 
<212> PRT 

<213> Mouse mammary tumor virus 
<400> 19 

10 Met Met Pro Arg Gly Glu Gly Ser Asp lie Leu He Lys Gin Leu Ala 
1 5 10 15 

Trp Glu Lys Cys Lys Phe He Val Ser Gly Ser His Pro Pro Asn Thr 
20 25 30 

15 

Lys Asn Arg 
35 



20 <210> 20 
<211> 35 
<212> PRT 

<213> Cat Mammary Tumor Virus 
25 <400> 20 

Met Met Pro Arg Gly Glu Gly Ser Asp He Leu He Lys Gin Leu Ala 
1 5 10 15 

Tyr Lys Lys Cys Lys Phe He Val Pro Arg Ser Tyr Pro Ser Asn Thr 
30 20 25 30 

Lys Asn Arg 
35 



35 



40 



50 



<210> 21 

<211> 460 

<212> DNA 

<213> Human Mammary Tumor Virus 



<400> 21 

tcccttccct cgcctagtgt agatcagtca gatcagatta aaagcaaaag gatctatttg 60 
gaaattatac tccccctgtc aataaagagg ttcatcggtg gtatgaagca ggatgggtag 12 0 
aacctacatg gttctgggaa aattctccta aggatcccaa tgatagagat tttactgctc 180 
45 tagttcccca tacagaattg tttcgcttag ttgcagcctc aagatatctt attctcaaaa 240 
ggccaggatt tcaaggacat gacatgattc ctacatctgc ctgtgttact tacccttatg 300 
ccatattatt aggattacct cagctaatag atatagaaaa aagaggatct acttttcata 360 
tttcctgttc ttcttgtaaa ttgactaatt gtttagattc ttctgcctac gactatgcag 420 
cgatcatagt caagaaggcg ccatatgtgc tgctacctgt ~ 460 



<210> 22 

<211> 38 

<212> PRT 

<213> Human Mammary Tumor Virus 

<400> 22 
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- 10 



10 



15 



35 



50 



Ser Leu Pro Ser Pro Ser Val Asp Gin Ser Asp Gin lie Lys Ser Lys 
1 5 10 15 

Arg lie Tyr Leu Glu lie lie Leu Pro Leu Ser lie Lys Phe lie Gly 
20 25 30 

Gly Met Lys Gin Asp Gly 
35 



<210> 23 
<211> 461 
<212> DNA 

<213> Human Mammary Tumor Virus 



<400> 23 

tcccttccct cgcctagtgt agatcagtca gatcagatta aaagcaaaaa ggatctattt 60 
ggaaattata ctccccctgt caataaaggg gttcatcgat ggtatgaagc aggatgggta 12 0 
gagcctacat ggttctggga aaattctcct aaggatccca atgatagaga ttttactgct 180 
20 ctagttcccc atacagaatt gtttcgctta gttgcagcct caagatatct tattctcaaa 240 
aggccaggat ttcaaggaca tgacatgatt cctacatctg cctgtgttac ttacccttat 3 00 
gccatattat taggattacc tcagctaata gatatagaaa aaagaggatc tacttttcat 360 
atttcctgtt cttcttgtaa attgactaat ttgtttagat cttctgccta cgaatatgca 420 
gcgatcatag tcaagaaggc gccatatgtg ctgctacctg t 461 

25 

<210> 24 
<211> 153 
<212> PRT 
30 <213> Human Mammary Tumor Virus 

<400> 24 

Ser Leu Pro Ser Pro Ser Val Asp Leu Ser Asp Gin lie Lys Ser Lys 
15 10 15 



Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
20 25 30 



Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
40 35 40 45 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
50 55 60 

45 Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg Tyr Leu lie Leu Lys 
65 70 75 80 

Arg Pro Gly Phe Gin Glu His Asp Met lie Pro Thr Ser Ala Cys Val 
85 90 . 95 



Thr Tyr Pro Tyr Ala He Leu Leu Gly Leu Pro Gin Leu He Asp He 
100 105 110 



Glu Lys Arg Gly Ser Thr Phe His lie Ser Cys Ser Ser Cys Arg Leu 
55 115 120 125 

Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala He He Val 
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5 



11 - 



130 135 140 

Lys Lys Ala Pro Tyr Val Leu Leu Pro 
145 150 



<210> 25 
<211> 461 
<212> DNA 
10 <213> Human Mammary Tumor Virus 

<400> 25 

tcccttccct cgcctagtgt agatcagtca gatcagatta aaagcaaaaa ggatctattt 60 

ggaaattata ctccccctgt caataaagag gttcatcgat ggtatgaagc aggatgggta 120 

15 gaacctacat ggttctggga aaattctcct aaggatccca atgatagaga ttttactgct 180 

ctagttcccc atacagaatt gtttcgctta gttgcagcct caagatatct tattctcaaa 240 

aggccaggat ttcaagaaca tgacatgatt cctacatctg cctgtgttac ttacccttat 300 

gccatattat taggattacc tcagctaata gatatagaaa aaagaggatc tacttttcat 360 

atttcctgtt cttcttgtag attgactaat tgtttagatt cttctgccta cgactatgca 420 

20 gcgatcatag tcaagaaggc gccatatgtg ctgctacctg t 461 

<210> 26 
<211> 153 
25 <212> PRT 

<213> Human Mammary Tumor Virus 

<400> 26 

Ser Leu Pro Ser Pro Ser Val Asp Leu Ser Asp Gin He Lys Ser Lys 
30 1 5 10 15 

Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Gly Val His 
20 25 30 

35 Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
35 40 45 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
50 55 60 

40 

Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg Tyr Leu He Leu Lys 
65 70 75 . 80 

Arg Pro Gly Phe Gin Glu His Asp Met He Pro Thr Ser Ala Cys Val 
45 85 90 95 

Thr Tyr Pro Tyr Ala He Leu Leu Gly Leu Pro Gin Leu He Asp He 
100 105 HO 

50 Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Leu 
115 120 125 

Thr Asn Cys Leu Asp Ser Ser Ala ' Tyr Asp Tyr Ala Ala He He Val 
130 135 140 

55 

Lys Lys Ala Pro Tyr Val Leu Leu Pro 
145 150 
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<210> 27 
<211> 460 
5 <212> DNA 

<213> Human Mammary Tumor Virus 

<400> 27 

tcccttccct cgcctagtgt agatcagtca 
10 ggaaattata ctccccctgt caataaaggg 
gagcctacat ggttctggga aaattctcct 
ctagttcccc atacagaatt gtttcgctta 
aggccaggat ttcaagaaca tgacatgatt 
gccatattat taggattacc tcagctaata 
15 atttcctgtt cttcttgtag attgactaat 
gcgatcatag tcaagaaggg ccatatgtgc 



gatcagatta aaagcaaaaa ggatctattt 60 
gttcatcgat ggtatgaagc aggatgggta 120 
aaggatccca atgatagaga ttttactgct 180 
gttgcagcct caagatatct tattctcaaa 240 
cctacatctg cctgtgttac ttacccttat 300 
gatatagaaa aaagaagatc tacttttcat 360 
ttgtttagat cttctgccta cgaatatgca 420 
tgctacctgt 460 



<210> 28 
20 <211> 152 
<212> PRT 

<213> Human Mammary Tumor Virus 
<400> 28 

25 Ser Leu Pro Ser Pro Ser Val Asp Leu Ser Asp Gin lie Lys Ser Lys 
1 5 10 15 

Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
20 25 30 

30 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
35 40 45 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro Kis 
35 50 55 60 

Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg Tyr Leu lie Leu Lys 
65 70 75 80 

40 Arg Pro Gly Phe Gin Glu His Asp Met lie Pro Thr Ser Ala Cys Val 

85 90 95 

Thr Tyr Pro Tyr Ala lie Leu Leu Gly Leu Pro Gin Leu lie Asp lie 
100 105 110 

45 

Glu Lys Arg Gly Ser Thr Phe His lie Ser Cys Ser Ser Cys Arg Leu 
115 120 125 

Thr Asn Cys Leu Asp Ser Ser Ala Tyr Glu Tyr Ala Ala He He Val 
50 1 30 135 1 40 

Lys Lys Gly His Met Cys Cys Leu 
145 150 

55 

<210> 29 
<211> 461 
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<212> DNA 

<213> Human Mammary Tumor Virus 
<400> 29 

5 tcccttccct cgcctagtgt agatcagtca gatcagatta aaaacaaaaa ggatctattt 60 

ggaaattata ctccccctgt caataaagag gttcatcgat ggtatgaagc aggatgggta 12 0 

gaacctactt gattctggga aaattctcct aaagatccca atgatagaga ttttactgct 180 

ctagttcccc atacagaatt gttccgctta gttgcagcct caagacatct tattctcaaa 240 

aagccaggat ttcaagaaga tgacatgatt cctacatctg cctgtgttac ttacccttat 300 

10 gccatattat taggattacc tcagctaata gatatagaga aaagaggatc tacttttcat 360 

atttcctgtt cttcttgtag attgactaat tgtttagatt cttctgccta cgactatgca 420 

gcgatcatag tcaagaaggc gccatacgtg ctgctacctg t 461 

15 <210> 30 
<211> 43 
<212> PRT 

<213> Human Mammary Tumor Virus 
20 <400> 30 

Ser Leu Pro Ser Pro Ser Val Asp Gin Ser Asp Gin lie Lys Asn Lys 
15 10 15 

Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
25 20 25 30 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr 
35 40 

30 
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(57) Abstract: The present invention is 
related to mammary tumor virus (MTV). 
MTV represents a group of retroviruses which 
possess very high homology to mouse mammary 
tumor virus (MMTV), a virus known to cause 
neoplastic mammary disease in mice. As 
described herein, MTV*s have been identified in 
human, cat, and Rhesus macaque. The present 
invention specifically provides for recombinant 
nucleic acids and polypeptides derived from 
these MTV's as well as methods for using these 
biological molecules. 
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(57) Abstract: The present invention is related to 
mammary tumor virus (MTV). MTV represents 
a group of retroviruses which possess very 
high homology to mouse mammary tumor virus 
(MMTV). a virus known to cause neoplastic 
mammary disease in mice. As described herein. 
MTV's have been identified in human, cat, and 
Rhesus macaque. The present invention specifically 
provides for recombinant nucleic acids and 
polypeptides derived from these MTV's as well as 
methods tor using these biological molecules. 
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(57) Abstract: The present invention is 
related to mammary tumor virus (MTV). 
MTV represents a group of retroviruses 
which possess very high homology to mouse 
mammary tumor virus (MMTV), a virus 
known to cause neoplastic mammary disease in 
mice. As described herein, MTV's have been 
identified in human, cat, and Rhesus macaque. 
The present invention specifically provides for 
recombinant nucleic acids and polypeptides 
derived from these MTV's as well as methods 
for using these biological molecules. 
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HUMAN ENDOGENOUS RETROVIRUS IN BREAST CANCER 



BACKGROUND OF THE INVENTION 

The present application takes priority from Provisional U.S. patent application number 
60/141,626 filed on June 30, 1999 the entire text of which is specifically incorporated herein by 
s reference. 

1. Field of the Invention 

The present invention relates to compositions and methods for the detection, prognostic 
evaluation, and treatment of oncogenic disorders, particularly breast cancer. 

Specifically, the instant invention provides for compositions, useful for identifying and 
.o treating disorders related to newly identified endogenous retroviruses which are present in a 
subset of humans, cats, and nonhuman primates. 

2. Technical Problem Addressed by the Invention 

Mutations in known susceptibility genes do not account for all breast cancer 

Breast cancer (BC) is one of the leading causes of cancer death among women. The 
is induction of BC is thought to involve the interplay of several factors, including the genetic, 
hormonal, immunological and physiological status of the host, as well as dietary habits and 
exposures to chemicals, radiation or infectious agents. It is now clear that variations of several 
genes, including BRCA-1 and BRCA-2, can result in greatly increased risks for development of 
BC. However, defects in known BC susceptibility genes account for only about 5% of BC, the 
20 so-called familial cases (Armstrong et al. , 2000; Gayther et al. , 1 998). 

As with other types of cancer, the possibility that a virus is etiologically involved in 
sporadically occurring BC has not been eliminated. Consequently, there has long been a need to 
determine which, if any viruses are causally linked to the development of BC. The identification 
of such a virus would likely provide invaluable aid in the following areas BC medicine: 
25 prevention, diagnosis, determination of prognosis; and treatment. 
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3. Description of Related Art 
Retrovirus induction of breast cancer in mice 

Mouse mammary tumor virus (MMTV), a B-type retrovirus, was discovered during 
studies of hereditary cancer in mice at the Jackson Laboratories in the 1930's (Bittner, 1936). As 
the prototype of slow-transforming retroviruses, MMTV has been definitively shown to cause 
BC in mice. Prior studies established that MMTV is transmitted both in the germline as 
endogenous proviruses and exogenously via milk. As endogenous elements MMTV proviruses 
follow patterns of Mendelian inheritance, as other sequences in the genome (Cohen et al, 
Ce//,1979; Cohen and Varmus, 1979, 1980; Traina et al., 1981; Traina-Dorge and Cohen, 1983; 
Traina-Dorge et al, 1985; Varmus et al, 1978). Horizontal transmission of MMTV typically 
occurs by infection of mouse pups by MMTV virions present in the milk of infected dams. 
Thus, it is possible to transmit MMTV to mice by foster feeding. 30 or more unique provlral 
integration sites for endogenous MMTV have been identified. However, some wild mice do not 
carry any endogenous MMTV proviruses (Cohen and Varmus, 1979; Cohen et al., 1982). This 
result suggests that the many endogenous MMTV proviruses are relatively recent additions to the 
mouse genome. The most likely explanation is that MMTV entered the germline of certain mice 
(but not others) on multiple occasions after the evolutionary splits among the various species and 
subspecies of the genus Mus. Certain endogenous MMTV can be activated by hormones to form 
infectious virions capable of inducing mammary carcinomas after long latency periods. Most 
endogenous MMTV proviruses are defective and do not encode for infectious virions. 

Roles of MMTV genes and cellular genes in oncogenesis 

The MMTV Orf protein can function as a superantigen (SA). When expressed in the 
thymus during fetal/early development it can mediate complete or incomplete deletion of SA- 
reactive T-cells. SA expression is required to activate B-cells targets of MMTV in the 
gut-associated lymphoid tissue of nursing pups. Complete deletion of the SA responsive clones 
thus renders the mice resistant to MMTV infection in the gut and thereby leads to a low 
incidence of MMTV-induced tumors. On the other hand, in mice with only partially deleted 
responsive clones of lymphoid cells the SA activation stimulates expansion of the targets and 
spread of MMTV. As female infected animals reach puberty, estrogenic hormones drive 
expression of the MMTV long terminal repeat (LTR) through its hormone response element 
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(HRE). This permits production and assembly of MMTV and spread of the virus to other 
hormonally-sensitive tissues, including the breast and ovaries. Integration of MMTV LTRs 
adjacent to certain cellular genes, such as the proto-oncogenes Jnt, Wnt and Fgf, can increase 
expression of these genes resulting in BC and other cancers. 

5 The molecular genetic interactions between MMTV, the immune system of its murine 

host, and the breast and other hormonally-sensitive cells malignantly transformed by this 
retrovirus have been extensively studied. MMTV promotes mammary gland cancer in mice by 
insertional mutagenesis (Varmus et aL, 1978; Varmus, 1985). MMTV proviral LTR elements 
direct steroid hormone-dependent transactivation of various cellular oncogenes including Wnt, 

10 Fgf and Int thereby promoting clonal expansion of tumor cells (Shackleford and Varmus, 1987, 
Shackleford et al, 1993; Jakobovits et aL, 1986; Nusse, 1991; Nusse et a/., 1985). For 
productive, persistent infection and completion of its replication cycle, MMTV must contain a 
superantigen and interact with a functional host immune system (Golovkina et al y 1995; Luther 
and Acha-Orbea, 1996; Coffin, 1992). 

is The search for a human breast cancer virus 

The discovery of the oncogenic MMTV has prompted many investigators to explore a 
retroviral etiology for BC in humans (Sarkar, 1980). Data collected over the past five decades 
has suggested the existence of a human homologue of MMTV. In 1971, Moore and associates 
reported that 60% of human milk samples from BC patients contain B-type particles 

20 indistinguishable from MMTV by electron microscopy, compared to 5% of the general 
population (Moore et aL, 1971). These investigators also reported that 39% of Parsi women of 
India, an inbred population with a two-fold increased incidence of BC, had B-type particles in 
their milk (Das et aL, 1972; Moore, 1971). Several studies have demonstrated that BC cells, but 
not cells from normal tissues, also contain reverse transcriptase (RT), an enzyme associated with 

25 all retroviruses. Numerous investigators have examined serum and breast milk for the presence 
of antibodies reactive with MMTV. Most of these studies were performed in the pre-AIDS era, 
prior to the advent of highly sensitive and specific techniques for detecting anti-retroviral 
antibodies made necessary for detection of HIV antibodies in donated blood. 

Despite the numerous electron microscopic, biochemical and immunological studies on 

30 human breast carcinoma tissue, milk, patients 1 sera, and breast carcinoma cell lines suggesting 
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the existence of a human homologue of MMTV, proof that such an agent exists has remained 
elusive (Andersson et al, 1996; Ziegler, 1997). Most authors have dismissed the importance of 
prior studies purporting to show evidence of a human homologue of MMTV because of the 
presence of numerous human endogenous retroviruses (HERVs) (Larsson et ai, 1994; Li et al, 
5 1 996; Lower et al , 1 996; Meese et al , 1996; Ono, 1986; Patience et al , 1 996; Faff et al , 1 992). 
There are about 50,000 HERVs or HERV -related sequences in the human genome, some of 
which have been shown to have up to 60% homology to MMTV. In this regard, it is important 
to note that seroreactivity to HERV-K10, to this point the HERV considered to be most closely 
related to MMTV, cannot account for MMTV -reactive antibodies present in the sera of breast 

10 cancer patients and the smaller number of healthy individuals (Vogetseder et al, 1995). 
Furthermore, we believe that the presence of these MMTV-related sequences is precisely the 
reason that human homologues of MMTV have not previously been demonstrated conclusively 
by molecular techniques. The presence of these related, but distinct, sequences could have 
obscured the detection of more closely related sequences by prior investigators who used less 

is sensitive techniques, such as Southern blotting. 

Only recently have sequences with relatively high homology (>90%) to those of MMTV 
been isolated from human BC tissue (Wang et al, 1995, 1998; ). Sequences 95 - 99% similar to 
MMTV env were amplified by PCR in 121 (38.5%) of 314 unselected breast cancer tumor 
samples. It is pertinent to note that the MMTV-like sequences were detected in only 2 (1.8%) of 

20 1 07 breast specimens from reduction mammoplasties and in 0/80 samples from normal tissues or 
non-breast tumors. The MMTV-^v like RNA was expressed (as determined by RT-PCR) in 
66% of DNA PCR positive breast tumors (Wang et al, 1998). A complete 9.9 kb provirus with 
94% similarity to MMTV was detected in 2 breast tumors. FISH (fluorescence in situ 
hybridization) revealed integration at several sites in DNA derived from BC tumors, but not 

25 normal breast cells (Wang et al., 1999 ACR mtg. abstracts #2933, 2944). Wang et al suggested 
the existence of a human mammary tumor virus (HMTV) that is spread by the exogenous route 
of infection (horizontal transmission). Attempts by these and other investigators to amplify other 
regions of MMTV-related viruses, from the genomic DNA or cDNA of subjects who did not 
have BC, yielded HERV sequences (such as HERV-K10) with only about 60% homology to 

30 MMTV. Thus, BC tissues are the only tissues in which sequences that are highly similar to those 
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of MMTV have been heretofore found. Consequently normal breast tissue and other tissues 
appeared to be negative for the expression of viruses with high homology to MMTV. 

MMTV can be transmitted horizontally or vertically 

The retrovirus replication cycle is characterized by conversion of the single-stranded 
5 RNA viral genome into doubled-stranded proviral DNA by the multiple enzymatic activities of 
the virion-associated reverse transciptase (RT). As is the case with other retroviruses, 
integration of MMTV proviral DNA into the genome of host cells is required for expression of 
viral proteins and production of infectious progeny. In both infected mouse mammary glands as 
well as heterologous cells MMTV proviral DNA is integrated into a large number of apparently 

10 random sites. Integration of MMTV proviruses containing transcriptionally active LTR near 
some cellular genes (proto-oncogenes), such as Int 9 Wnt, and Fgf can result in over-expression of 
these genes, cellular transformation and clonal expansion of the tumor cells (Varmus, 1985; 
Shackleford et al. 9 1993; Jakobovits et aL, 1986; Cohen, 1980; Breznik and Cohen, 1982). The 
long latency of MMTV-induce carcinogenesis is explained in part by the necessity for proviruses 

is to integrate into these particular sites. 

Genetic differences among viral strains of MMTV can account in part for the varying 
incidence of BC in diverse strains of mice. Mice of the C3H strain have a greater than 90% 
incidence of BC, compared to a <1% incidence ■ of BC in BALB/c mice. BALB/c mice foster- 
nursed on C3H females have a high incidence of BC which suggested that the turn ori genie 

20 MMTV of C3H can be horizontally transmitted in the milk (Bittner, 1936). Conversely, when 
C3H mice are foster nursed on a BALB/c female the incidence of BC is significantly lower (22- 
55%), but not as low as low-incidence mouse strains. This latter observation underscores the 
importance of the horizontally-transmitted milk-born virus in the high incidence strains, but also 
indicates the substantial differences in tumorigenic potential among endogenous MMTV 

25 proviruses. Proviruses of various mice with high and low tumor incidence could be 
distinguished by differences in solution hybridization kinetics and restriction endonuclease 
digestion patterns (Cohen et al. Cell, 1979; Cohen and Varmus, 1979, 1980; Traina-Dorge and 
Cohen, 1983; Breznik et al. 1984). Using restriction enzymes that differentiate between 
hypomethylated and methylated DNA, proviruses of milk-borne MMTV were also shown to be 

30 hypomethylated, whereas most endogenous proviruses contain abundant 5-methylcytosine 
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(Cohen, ,980; Breznik and Cohen, ,982; Breznik „ „/., I984) . Because hypomethy|ation „ 
assoc.ated with increased gene expression, rhis observation could explain the importance of 
honzontally-transmmed milk-borne MMTV in mouse strains with a high incidence of BC 
Spectfic hypomethylation of an endogenous MMTV provims was associated with expression of 
> a 1 .6 kb transcript of the lacing mammary g ,a„d of the BALB/c tnouse (Traina-Dorge e, at., 
1985). 

In other studies, endogenous MMTV provides have been shown ,o segregate as stable 
genettc units during inbteeding and that cenain of these endogenous provtntses are 
transcrtptional.y active (Cohen e, a/., C*//,,979; Cohen attd Varntus, 1979, !980; Traina „ a, 
. 1981; Traina-Dorge and Cohen, ,983; Traina-Dorge „ a,., ,985; Varmus e, a, 1978)' 
Recombinant inbred (RI) strains of tnice have been used to define MMTV proving con.posi.ion 
and chromosomal ,oca,io„s (Traina e, a,., ,981; Traina-Dorge and Cohen, ,983). Ri straitS 
were developed by crossing mice front lwo highJy inbred , jnes randomly ^ ^ 

generatton brothers and sisters and maintaining each as separa,e lines. Many genetic loci were 
mapped ,o specific chromosomes in these strains. By analyzing the segtegation patterns of 
vartous MMTV provides with these genetic markers by restriction endonuclease analysis and 
Southern biorting, i, was possible ,o detennine the chrom OS oma, locations of a number of 
MMTV provides in rhese snains and ,„ establish that these points segregate by the rule of 
Mendeltan inheritance. These and other analyses defined ,0 distinct MMTV units (provides) 
m these animals some m lengm and some treated. The ratios of inheritance for most of the 
MMTV provimses were consistent with simple single-gene inheritance, though thzee of the 
MMTV units demonstrated some variance. Most noticeable was the presence of a uni, 
contnbuted by one parent tha, was present in 23 of the 26 RJ strains analyzed. Specific proviral 
untts were identified tha, were transcriptional active and associated with increased tumor 
production (Traina-Dorge „ a,., 1985). taportantly, cellulaz genes were identified that were 
stgntficantly associated with virus expression showing the involvement of host genetics in 
disease progression (Traina-Dorge et al. , 1 985). 

Several lines of evidence indicate that the endogenous MMTV provides have integrated 
relanvely recently in the germline of various strains of mice (Cohen and Varmus, 1979; Varmus 
«' «/.. 1978). If MMTV evolved from elements present in a progenitor of Mus musculus (the 
laboratory m0 use), it would be expected that all individual mice wou.d have similar MMTV 
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proviruses. However, both laboratory mice and wild mice trapped at several locations having 
quite variable numbers and distributions of germline integration sites for their MMTV proviruses 
(Cohen and Varmus, 1979). The most striking finding among these results were the presence of 
some wild caught animals that contained no endogenous MMTV proviruses. The most likely 
5 interpretation of these results is that the endogenous MMTV proviruses arose by multiple 
independent integrations into the DNA of germinal cells after speciation of the genus Mus, rather 
than arising from genetic elements present in the evolutionary progenitors of mice. 

SUMMARY OF THE INVENTION 

The present invention provides for recombinant DNA molecules derived from one or 
io more mammary tumor viruses which are endogenous retroviruses with homology to the 
sequences of MMTV. Specifically, the sequences of the instant invention have at least 99% 
identity with all or a portion of SEQ ID NOs: 2, 3, 4, 5, 6, 7, 8, 21, 23, 25, 27, or 29 or at least 
92% identity with SEQ ID NO: 10. 

Additionally, the present invention provides for the RNA molecules produced by 
15 transcription of the DNA described above. Furthermore it provides for the polypeptides resulting 
from the in-frame translation of RNA which has been transcribed from the MTV DNA of the 
instant invention. According to the present invention, the referenced DNA sequences may be 
derived from any suitable source, such sources may include, but are not limited to, human, cat, 
and rhesus macaque. 

20 The present invention also provides for a recombinant DNA plasmid (a vector) which 

comprises mammary tumor virus (MTV) DNA the sequences of SEQ ID NOs: 2, 3, 4, 5, 6, 7, 8, 
10, 21, 23, 25, 27, or 29, or sequences which have at least 99% identity thereto. That is said 
DNA sequence is incorporated in the vector. 

In one embodiment of the present invention the vectors further comprise a heterologous 

25 promoter operably linked to the MTV sequence (i.e. joined in the proper reading frame so as to 
be capable of producing functional MTV RNA and/or protein in vivo and in vitro). In one aspect 
of this embodiment of the invention the vector, which comprises the MTV DNA, is capable of 
episomal replication or chromosomal integration in at least one of the following cell types: 
bacterial cells, yeast cells, insect cells , avian, cells, and mammalian cells (this list of cell types is 

30 representative and should not be considered exhaustive). In another aspect, of this embodiment 
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of the invention, the heterologous promoter provides for the expression of the MTV DNA 
sequence in one or more cell types. Cell types considered useful as part of this aspect of the 
invention include, but are not limited to the following: bacterial cells, yeast cells, insect cells , 
avian, cells, and mammalian cells. 

According to another embodiment of the instant invention the MTV DNA sequences 
described above may be used to provide a method of detecting the presence of MTV DNA or 
RNA in a sample (of biological origin, such as serum, or otherwise). 

Another embodiment of the instant invention provides for a method of determining 
whether a sample contains antibodies which recognize proteins derived from the MTV DNA 
sequences described above (e.g. polypeptide sequences derived from transcription and translation 
of SEQ ID NOs: : 2, 3, 4, 5, 6, 7, 8, 10, 21, 23, 25, 27, or 29). As a corollary to this aspect, the 
instant invention also provides for antibodies which specifically detect one or more ,of the 
polypeptides of the instant invention. 

Another embodiment of the invention provides for diagnostic kits useful for detecting 
is DNA, RNA, or polypeptides, from a mammary tumor virus, in a biological or other type of 
sample. 

Another embodiment of the instant invention provides for methods of attenuating or 
eliminating the activity of MTV in its host animal. Various aspects of this embodiment provide 
for pharmaceutical compositions comprising substances which disrupt the activity of the MTV 
20 reverse transcriptase, protease, or integrase enzymes. Other aspects of this embodiment provide 
for pharmaceutical compositions capable of eliciting an immune response in a host animal. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The following drawings form part of the present specification and are included to further 
demonstrate certain aspects of the present invention. The invention may be better understood by 
25 reference to one or more of these drawings in combination with the detailed description of 
specific embodiments presented herein. 

Figure 1 shows the results of amplification of sequences related to MMTV from human 
breast cancer tissue. Panel A: DNA was extracted from human breast tumors (kindly provided 
by Michael Press, M.D., USC, Los Angeles or Derrick Beech M.D., TMC/UT Memphis) and 
30 PCR was performed using primers specific for the human MMTV e«v-related gene. PCR 
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products were transferred to nitrocellulose by blotting and MMTV-related products were 
detected by hybridization to a 1.8 kb MMTV env probe, lane a: nonradioactive markers, not 
shown; lanes b-d, f-h, j: breast tumor DNA; lanes e and i: no DNA; lane k: water control; lane 1 
positive control: MMTV env fragment cloned in pBluescript™. Panel B: As a test for the 
5 integrity of the DNA from the clinical samples we amplified HERV-3 proviral DNA, a single 
copy human endogenous retrovirus using PCR conditions developed by Griffiths et al (1997). 
Ethidium bromide detection (markers are visible in lane a). Same samples as Panel A except 
lane 1: positive control, HERV3 pol fragment cloned in pBluescript™ (Stratagene). Visible 
bands were present in lanes c and g, but do not copy well. 
io Figure2 shows an example of the amplification of sequences related to MMTV from the 

blood of healthy controls. Panel A: DNA was extracted from whole blood of healthy control 
subjects and PCR was performed using primers specific for the human MMTV e*v-related gene 
and PCR products were detected by Southern hybridization, lane a: markers; lanes b-k: DNA 
from whole blood of healthy controls; lane 1: water control; lane m positive control: MMTV env 

is fragment in pBluescript™ (Stratagene). Panel B: PCR amplification of HERV-3. Ethidium 
bromide detection (markers are visible in lane a). Same samples as Panel A except lane m: 
positive control, HERV3 pol fragment in pBluescript™. 

Figure 3 shows an example of the detection of HMTV mRNA by ribonuclease protection 
assay. RNA was extracted from three HMTV PCR positive breast cancer tumors or from yeast 

20 cells and hybridized to either 1 89 base probe specific for HMTV or a 245 base probe for P-actin. 
Samples were then digested with RNAse A/Tl. The fragments of the labeled probes protected by 
hybridized RNA were visualized and analyzed following separation on a denaturing 
polyacrylamide gel. Two of the three tumor samples gave protected fragments of the expected 
size with the HMTV probe (arrow) (lanes 1-3), whereas all three tumor RNAs gave protected 

25 fragments with the P-actin probe (lanes (5-7). As expected neither probe was specifically 
protected by yeast RNA (lanes 4, 8). 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides for isolated DNA, RNA, and polypeptides derived from 
mammary tumor virus (MTV) which is a virus with high sequence homology to mouse 
30 mammary tumor virus. The present invention further provides for diagnostic methods of using 
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these macromolecules. Additionally the present invention provides pharmaceutical reagent 
compositions and diagnostic kits comprising MTV DNA, RNA, and/or polypeptide, which can 
be used according to the disclosed methods. 

Various embodiments of the instant invention provide for MTV nucleic acid 
corresponding to at least one of the following: DNA sequences with at least 99% identity with 
one of the following: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5 5 7, 8, 21, 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous bases 
of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence represented by 
nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous nucleotides of the 
sequence represented by nucleotides 1-400 of SEQ ID NO:4, at least 130 contiguous 
nucleotides of the sequence represented by nucleotides 1-360 of SEQ ID NO: 5; at 
least 130 contiguous nucleotides of SEQ ID NO:6, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-380 of SEQ ID NO:7, at 
least 210 contiguous nucleotides of the sequence represented by nucleotides 20-462 
of SEQ ID NO:8, at least 160 contiguous nucleotides of the sequence represented by 
nucleotides 97-462 of SEQ ID NO:21, at least 170 contiguous nucleotides of SEQ ID 
NO:23, at least 100 contiguous nucleotides of the sequence represented by 
nucleotides 337-462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the 
sequence represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 
contiguous nucleotides of SEQ ID NO:29; or 

c) at least 92% identity with SEQ ID NO: 1 0.. 

Preferably the sequences have greater than 99% identity with greater than 200, 250, 300, 
350 or 400 nucleotides SEQ ID NO's: 2, 3, 4, 5, 6, 7, 8, 21, 23, 25, 27, or 29. In another 
preferred embodiment the sequences have greater than 93%, 94%, 95%, 96%, 97%, 98% or 99% 
identity with SEQ ID NO: 10. Even more preferably these nucleic acids are derived from human, 
cat, or rhesus macaque MTV's. Most preferably the DNA is identical with all or part of SEQ ID 
NO's: 2, 3, 4, 5, 6, 7, 8, 10, 21, 23, 25, 27, or 29. 

In one aspect of this embodiment of the invention the MTV DNA sequences described 
above are incorporated in a vector. In various related aspects the invention the MTV sequences 
are under the transcriptional control of an heterologous promoter. The vectors contemplated as 
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bang useful according ,o the instant invention are capable of expressing the MTV DNA 
sequences in at least one of the fo.lowing cell types: insect cells, bactenal cells, avian cells 
yeast cells, or mammalian cells. Furthermore the vectors of this aspect of the invention are also 
capable of episomai replication and/or chromosomal integration in at least one of the cell types 
5 listed. JF 

In another aspect of , his embodiment the DNA sequences comprise one or more detection 
mo.ehes. Detection moieties con>e mpl ated as being sui,able for the insum, invention include 
hu.are no, limited ,o fluorescent dyes, and radioactive isotopes of phosphorous, sulfur, oxygen' 
carbon, or hydrogen. ' 

In yet another aspect of this embodiment of me invention ,he isolated DNA molecule is 
suspended or dissolved in a diluent compatible with the invention. Sui,able diluents do no. 
interfere with ,he use of the DNA according to the various embodiments of the instant invention 
D,luen,s usefitl according to this embodiment of the invention are we,, known ,o those skiUed in 
the a„. They include bu. are no, limited ,o buffered aqueous solutions. These may be buffered 
, w,m any compound compatible wim ,he presen, invention. Exemplary buffering agen,s include 
Phosphate buffers and buffers comprising tiishydroxyaminomemane (Tns). Such buffered 
aqueous solutions may farther comprise any other compound, such as sodium chloride, which 
wl. no, ,„,erfcre with the operation of ,he ins*n, invention. According ,o mis aspec, of me 
■ns,an, invention me MTV DNA may be presen, in any suiuble concentration. Preferably ,he 
concenlrauon if from about 0.1 ng/ul to 100 pg/pl. 

0,her embodiment of the insmn, invention provide for RNA transcripts and/or 
polypeptides encoded by the nucleic acids described above. ,„ various aspecs of mis 
embod.men, these RNA transcripts or polypeptides are encoded by any of me DNA sequences 
desenbed above. These RNA ti^scripts and polypeptides are homologues of RNA .ranscriprs 
and polypeptides encoded by me MMTV e„v, gag , or gene , ,„ one 
embodunen, me RNA nanscrip b are encode, by the DNA sequences described above. Such 
RNA transcripts have a sequence identical ,o me disclosed DNA sequence except ,ha, ,he RNA 
comams uridine ins,ead of thymidine. In anoUter preferred aspee, of tins embodiment me 
purged po,ypeptide corresponds in sequence w it h al, or pan of an e„v, gag , or pro, pro,ei„ 
product from a human, cat, or rhesus macaque mammary .umor virus. I, is preferred ma, me 
pepudes be purified po.ypeptides. In a preferred aspec, of ,he presen, invention me purified 
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polypeptides are comprised of all or part of the sequence resulting from the in-frame 
transcription and translation of any of the MTV DNA sequences described above. In an even 
more preferred aspect of the present invention the polypeptides correspond in sequence to the 
product of an in-frame translation of SEQ ID NO's: 2, 3, 4, 5, 6, 7, 8, 10, 21, 23, 25, 27, or 29. 
5 Most preferably the polypeptide sequences of the instant invention correspond to all or at least 
80 amino acids of SEQ ID NO's 12, 13, 14, 15, 16, 17, 18, 20, 22, 24, 26, or 28. The term 
"derived from MTV DNA" is meant to convey that the RNA or peptide corresponds in sequence 
to the RNA or peptide produced respectively by the transcription or transcription and in-frame 
translation of MTV DNA. 

io By "purified polypeptides" it is meant that the majority (greater than 50%) of the 

polypeptides in the sample are the MTV polypeptides. Preferably, the MTV polypeptides 
constitute greater than 70% of the polypeptide in the purified polypeptide sample. More 
preferably the MTV polypeptide constitutes greater than 90% of the polypeptide in the sample. 
Even more preferably the MTV polypeptide constitutes greater than 95% of the polypeptide in 

is the sample. Additionally, the term "purified polypeptide" indicates that the sample does not 
contain substances which interfere with the operation of the instant invention. Purified 
polypeptides obtained in accordance with the current invention may be from any suitable source 
which is compatible with the instant invention. 

Another embodiment of the instant invention provides for an antibody against an MTV 

20 polypeptide as described above. Antibodies according to this aspect of the invention are 
generated using one or more of the polypeptides of the instant invention as the antigenic agent. 
Such antibodies may be either monoclonal or polyclonal in nature. Preferably the antibodies are 
monoclonal. Once the MTV has been provided the antibodies of this aspect of the invention may 
be prepared according to methods well known to those skilled in the art. For example, 

25 polyclonal antibodies are commonly produced by injecting the antigenic agent (in the presence 
of an immune response enhancing agent such as complete Freunds adjuvant) into an animal such 
as a sheep, bovine, equine, goat, or rabbit. 

Monoclonal antibodies according to this aspect of the present invention can be prepared 
by hybridoma fusion techniques or by techniques that utilize Epstein Ban Virus (EBV) - 

30 immortalization technologies (to produce human mAbs), such as are well known by those of skill 
in the art, modified as described herein. In the method of the invention, these techniques involve 
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the injection of an immunogen, in this case, purified MTV polypeptide so as to elicit a desired 
immune response in that animal (i.e., production of antibodies). The experimental animal, (e.g., a 
mouse) is given repeated injections (boosts) of the same immortalized cell line. In a final step, 
the animal is given an injection of primary cells of the chosen cell type. 
5 In the illustrative example herein for the production of agonist monoclonal antibodies to 

megakaryocyte cells, a CMK cell preparation and a CMS cell preparation were used as the first 
immunogens; however, other immortalized megakaryocyte cells, such as Mo7e or DAMI cells, 
could have been used. Other monoclonal antibodies analogous to the agonist antibody of the 
invention BAH-1, which specifically recognizes the C-Mpl receptor, can be generated using 

io membrane bound c-Mpl receptor protein as the immunogen. To generate agonist monoclonal 
antibodies against other cell types, other cells of hemopoietic lineage are chosen, e.g., stem cells, 
B cells or T cells. In the first immunization step stem cells can be represented, e.g., by the 
immortalized cell line CTS; B cells by the immortalized cell lines ARH-77, SB or Nal-6; and T 
cells by the immortalized cell lines Jurkat or H9. 

is After a sufficient time, the animal is sacrificed and somatic antibody-producing cells may 

be derived from the lymph nodes, spleens and peripheral blood of primed animals. Spleen cells 
are preferred. Mouse lymphocytes give a higher percentage of stable fusions with the mouse 
myelomas described below. The use of rat, rabbit, frog, sheep and other mammalian somatic 
cells is also possible. The spleen cell chromosomes encoding desired immunoglobulins are 

20 immortalized by fusing the spleen cells with myeloma cells, generally in the presence of a fusing 
agent such as polyethylene glycol (PEG). Any of a number of myeloma cell lines may be used as 
a fusion partner according to standard techniques; for example, the P3-NSl/l-Ag4-l, P3-x63- 
Ag8.653 or Sp2/0-Agl4 myeloma lines. These myeloma lines are available from the American 
Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 201 10-2209. 

25 The resulting cells, which include the desired hybridomas, are then grown in a selective 

medium, such as HAT medium, in which unfused parental myeloma or lymphocyte cells 
eventually die. Only the hybridoma cells survive and can be grown under limiting dilution 
conditions to obtain isolated clones. The supernatants of the hybridomas are screened for the 
presence of antibody of the desired specificity, e.g., by immunoassay techniques such as those 

30 described herein, using the antigen that has been used for immunization. Positive clones can then 
be subcloned under limiting dilution conditions and the monoclonal antibody produced can be 
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isolated. Various conventional methods exist for isolation and purification of the monoclonal 
antibodies so as to free them from other proteins and other contaminants. Commonly used 
methods for purifying monoclonal antibodies include ammonium sulfate precipitation, ion 
exchange chromatography, and affinity chromatography. Hybridomas produced according to 
5 these methods can be propagated in vitro or in vivo (in ascites fluid) using techniques known in 
the art (see, generally, Harlow et al., Antibodies. A Laboratory Manual, Cold Spring Harbor 
Laboratory, pp. 1-726, 1988). 

Other embodiments of the instant invention provide methods for detecting DNA, RNA, 
and/or proteins from MTV in biological and/or other types of samples. 
io One aspect of this embodiment of the invention provides a method for the detection of 

MTV DNA in a sample which comprises the following steps: 

i) obtaining an sample suspected of containing one or more of the MTV DNA 
sequences described above; 

ii) carrying out a polymerase chain reaction (PCR) to amplify a DNA sequence as 
is defined in step i); and, 

iii) determining the sequence of, or otherwise characterizing, the amplicons (the PCR 
amplified DNA) produced in step ii) to determine whether or not the DNA 
sequence as defined in step i) is present in the sample. 

Another aspect of this embodiment provides a method for the detection of MTV RNA in 
20 a sample which comprises the following steps: 

i) obtaining an sample suspected of containing RNA which encoded by one or more 

of the MTV DNA sequences described above; 
iii) carrying out an RNAse protection assay (RPA); and, 

iii) analyzing the RPA results to determine whether RNA as defined in step i) is 
25 present in the sample and optionally quantitating said RNA. 

It will be recognized by those of ordinary skill that the selection of the parameters 
necessary for optimizing the steps of these methods are well within the abilities of the ordinarily 
skilled artisan. These parameters, for example PCR conditions (e.g. selection of annealing 
temperature, extension times, and primers) and DNA sequencing method, are routinely 
30 determined in labs where such molecular biological techniques are employed. 
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Another aspect of this embodiment of the invention provides for a method for analyzing a 
sample in order to determine whether the sample contains antibodies which recognize MTV 
polypeptides. This method comprises the steps of: 

i) obtaining a sample suspected of containing antibodies specific for mammary 
tumor viral antibodies; 

ii) obtaining at least one purified MTV polypeptide; 

iii) performing western immunoblot analysis using the sample of step i) and the 
polypeptide of step ii); and, 

iv) analyzing the results of step iii) to determine whether or not antibodies which 
specifically interact with the peptide of step ii) are present in the sample. 

Another aspect of this embodiment of the invention provides for a method for analyzing a 
cell culture or a tissue sample by one or more immunohistological methods in order to determine 
whether the sample contains MTV proteins. This method comprises the steps of: 

i) obtaining a sample suspected of containing mammary tumor viral proteins; 

ii) preparing the sample of step i) for immunochemical analysis; 

iii) incubating the sample of step ii) with one or a combination of two or more 
monoclonal or polyclonal antibodies specific for MTV polypeptides encoded by 
one or more of the MTV DNA sequences described above, wherein the said 
antibodies optionally have a moiety which allows for their specific detection; 

iv) washing the samples to remove antibody which is not specifically bound; 

v) processing the samples as appropriate for the selected detection method; and, 

vi) analyzing the results of step v.) to determine whether or not MTV proteins are 
present in the sample. 

It is envisioned that the immunochemical analysis may include, but is not limited to 
analysis by western blotting or enzyme-linked immunosorbant assay. It will be recognized that 
the selection of the parameters necessary for optimizing the performance of these methods are 
within the abilities of the ordinarily skilled artisan. Analysis may be performed using either 
direct immunochemistry (if the antibodies have been labeled with a detectable moiety) or by 
indirect immunochemistry. 

Other embodiments of the current invention provide for pharmaceutical compositions 
comprising MTV DNA, RNA and/or proteins described supra. These compositions may further 
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compnse a pharmaceutically acceptable excipient, carrier, or diluent and do not contain any 
b.olog.cally harmful substances. The pharmaceutical compositions of the present invention may 
be formulated by one having ordinary skill in the art. Suitable pharmaceutical formulates are 
described in Remington s Pharmaceutical Sciences which is a standard reference text in the field 
5 which is here in incorporated by reference. 

The pharmaceutical compositions may further comprise coloring or stabilizing agents 
osmonc agents, antibacterial agents, or any other substances as long as such substances do not' 
•nterfere with the function of the composition. The pharmaceutical compositions of the 
invention, can, for example, be formulated as a solution, suspension, or emulsion in association 
.o w,th a pharmaceutical^ acceptable parenteral vehicle. Examples of such vehicles are water 
sahne, Ringer's solution, dextrose solution, and 5% human albumen. Liposomes may also be' 
used. The vehicle may contain additives that maintain isotonicity (e.g., sodium chloride of* 
manmtol) and chemical stability (e.g., buffers and preservatives). It should be appreciated thar 
endotoxin contamination should be kept at a safe level, for example, less than 0.5 ng/mg protein - 
» Moreover, for human administration, preparations should meet sterility, pyrogenicity, general 
safety and purity standards as required by the United States Food and Drug Administration 
Office of Biological Standards. The formulations may be sterilized by commonly used 
techniques such as filtration. 

The phrase "pharmaceutically acceptable" refers to substances and compositions which 
20 do not produce an adverse, allergic, or otherwise untoward reaction when administered to an 
animal, or a human, as appropriate. A substance which caused produced any of these adverse 
effects would be classified as "biologically harmful" within the scope of the present invention 
Pharmaceutically acceptable substances and compositions include, but are not limited to 
solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption 
« delaying agents. Except where incompatible with the invention the use of any conventional 
ingredient is contemplated. Furthermore, supplementary active ingredients which serve some 
other pharmacologically expedient purpose can also be incorporated into the instant 
compositions. 

The methods and compositions described above are contemplated to be of great benefit in 
30 helpmg to determine whether or not MTV or MTV viral components are present in a person's 
tissues. This knowledge is would be of aid in predicting a patient's susceptibility to cancer 
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etiologically derived from MTV. The methods provide a means for determining a patients viral 
load (the amount of virus present in the person's tissues). In this respect it is envisioned that 
tremendous benefit would be achieved by adapting the methods herein described to the 
widespread screening of the general population, and in particular to all mature human females. 
5 Other compositions contemplated as part of various embodiments of the instant invention 

provide a means for stabilizing or reducing the amount of virus present in a human or other 
animal. 

The various methods described above are readily adaptable to preparing diagnostic kits 
for detecting the presence of MTV DNA, RNA, and/or polypeptides. Consequently, for the 
io clinical practice of the invention, yet other embodiments of the instant invention, which provide 
for diagnostic kits, are contemplated. Such kits are useful for the qualitative and quantitative 
analysis of a sample in order to detect, and perhaps determine the quantity of, MTV DNA, RNA, 
and/or polypeptide present therein. In one aspect of this embodiment of the invention the kit 
includes, as part of its components, one or more recombinant DNA molecules comprising one or 
is more of the MTV DNA sequences described above. Also according to this aspect of the 
invention the kit may comprise, in addition to (or in alternative to) the recombinant MTV DNA, 
one or more synthetic oligonucleotide primer pairs useful for the PCR amplification of these 
MTV DNA sequences. 

Another aspect of this embodiment of the invention provides for a kit for detecting 
20 antibodies which specifically recognize mammary tumor virus proteins. As part of its 
components this kit includes a reagent comprising one or more polypeptides encoded by the 
MTV DNA sequences described above. 

Also contemplated are kits useful for detecting the presence of MTV proteins by 
immunocytochemistry which comprise one or more antibodies (either monoclonal or polyclonal) 
25 which are specific for at least one MTV polypeptide. Optionally, these antibodies may be 
modified so as to comprise a detection moiety (e.g., a fluorescent die or a radioactive isotope, 
such as 35 S). 

Kits according to these embodiments of the invention may comprise packages, each 
containing one or more of the various reagents (typically in concentrated form) which are 
30 required to perform the respective diagnostic tests. The kits according to these embodiments of 
the invention are contemplated to be useful for detecting and/or quantifying MTV DNA, RNA, 
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and/or protein in biological (or other types of) samples. Such samples may be selected from, but 
are not limited to the following: blood plasma or serum, whole blood, urine, sputum, colonic 
effluent, cerebrospinal fluid, lymphatic fluid, bone marrow, tissue samples (such as from a 
surgical biopsy), or any other sample suspected of containing these biological molecules, 
s The kits also may further include one or more fiduciary results. As used herein a 

"fiduciary result" refers to a reference standard against which a test outcome is compared to 
gauge the results in terms of quality and/or quantity. A "fiduciary series" is a plurality of such 
references that represent points along a qualitative or a quantitative scale. Preferable in this 
regard, are kits that include a fiduciary series for interpreting results. The fiduciary may be in 
jo the form of one or more photographs or may be depicted in other ways, including written 
descriptions. 

Thus fiduciaries may be developed as part of the kits, in accordance with this aspec't^of 
the invention, to guide interpretation of results. In this regard, a RNA or polypeptide 
concentration from a biological sample may be characterized in accordance with the foregoing. 

is Sample may be taken from representative cross-section of patients at various stages of disease 
(including asymptomatic or essentially disease free) to prepare a fiduciary series. 
Characterization in this regard may benefit from hindsight, by following the actual course of 
neoplastic progression in patients as they undergo diagnosis, treatment, and follow up thereafter. 
The determination of viral load (as determined by the abundance of MTV DNA, RNA, or 

20 polypeptide), as set out above, for a variety of patients of known breast cancer status and 
eventual outcome, and the subsequent correlation of these values is of incalculable prognostic 
value. This will help provide the patient with a more accurate prognosis, and it will also aid the 
patient and oncologist in determining the best course of therapeutic treatment, when such 
treatment is necessary. 

Another embodiment of the instant invention provides for a composition which induces 
an immunological response against MTV in an animal. A specifically contemplated aspect of 
this invention is a composition which induces an immunological response to human mammary 
tumor viral protein in humans (see Example 7). It is expected that such induced immunity may 
prevent breast cancer in individuals that carry endogenous MTV, by blocking the spread of the 
30 virus to hormonally sensitive tissues. 



25 
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Methods for producing an immune response to viral DNA and/or RNA in animals are 
also known, see for example U.S. patent numbers 5,990,091 and 6,004,799 which are herein 
incorporated by reference. Thus compositions which elicit an immune response to MTV DNA 
and/or RNA are also contemplated as aspect of this embodiment of the invention. 
5 Other embodiments provide for therapeutic compositions which decrease the activity and 

presence of human mammary tumor virus. A common feature of all retroviruses is the presence 
of three enzymes involved in various stages of the viral replication cycle, the reverse 
transcriptase (RT), protease (PR), and integrase (IN). Various pharmaceuticals have been 
developed which target the RT and PR of human immunodeficiency virus (HIV), a retrovirus 

io distantly related to MTV. Furthermore, chemicals which selectively inhibit HIV IN have been 
identified and prototype drugs targeting this enzyme are under development (Hong et al y 1998; 
Mathe, 1999; Robinson, 1998; Singh et aL, 2000). According to one aspect of the instant 
invention a pharmaceutical composition comprising one or a mixture of two or more retroviral 
inhibitors in an amount effective to inhibit the activity or the spread of MTV's of the instant 

is invention is provided. 

Identification of the drug or drugs for use in the pharmaceutical compositions of this 
embodiment can be made using techniques known to those skilled in the art. For examples of 
protease and reverse transcriptase inhibitors and methods of determining the efficacy of such 
drugs as retroviral inhibitors see U.S. Patent Nos. 5,858,738, 6,017,928, and 6,046,228, which 

20 are herein incorporated by reference. The various compositions of this embodiment of the 
invention may comprise, for example, currently known HIV integrase, protease, or reverse 
transcriptase inhibitors; alternatively, such inhibitors can be modified so as to increase their 
specificity for MTV's. Likewise, other classes of HIV inhibitors, such as peptides which are 
analogs of the transmembrane glycoprotein, can serve as pro-drugs to develop specific drugs 

25 which decrease activity and presence of MTV in humans and other species. These compositions 
are contemplated as being useful to inhibit the activity and spread of MTV in humans and other 
species which carry these viruses as endogenous genetic elements. Such drugs could be used to 
treat breast cancer patients afflicted with MTV and would be expected to ameliorate the severity 
and to reduce the recurrence of disease. In addition, such drugs could be used prophylactically to 

30 prevent the occurrence of individuals at high risk for developing the disease. 
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EXAMPLES 

The following examples m included to demonslrate prefmed embodiments ^ ^ 
mvcntion. I, should be ap P recia,ed by those of slcitl in me an thal the lechniquK disc|ose<J .„ ^ 
examples which foUow represen, techniques discover by the inventor to function wel, in the 
. practice of the invention, and thus can be considered to constitute preferred modes for its 
pracce. However, those of skill in the an should, in light of the present disclosure, appreciate 
that many changes can be made in the specific embodiments which are disclosed and still obtain 
a hke or stmilar result without departing from the spirit and scope of the invention. 

EXAMPLE 1 : MMTV-related sequences in humans 

Sequences which are highly similar (>95%, to the MMTV env ge „e were amplified by 
PCR from human DNA samples, including subsets of borh BC (breas, cancer) tissue and norPBC 
rtssues. The MMTV-related sequences by were found by PCR and a sensitive blotting technique 
not only in breas, tumors (see Figure 1), bu, also in the blood of a subset of healthy contiols (see 
F.gure 2), and systemic lupus erythematosus (SLE) parients without breas, cancer. Our results 
dtfler from those of Wang and coworkers (1995) who, wi,h few exceptions, were able ,o de,ec, 
MMTV-hke sequences only in breas, .umors. The sequences from human DNA were distinc, 
from ,he MMTV sequences used as controls in these PCR reactions indicating ma, our result are 
no, stmply due ,o contamination. A ribonuclease protection assay was used ,o confirm ,hese 
results using a non-PCR based technique ,o de,ermine ,ha, the majori.y of me PCR positive BC 
tissues, bu, none of ,he PCR negative tissues, expressed this sequence a, ,he mRNA leve, Many 
of ,he products from these PCR reactions have been sequenced. Analysis of these sequences 
provdes further stiong evidence ,ha, PCR combination is an unlikely explanation for me 
observed resuhs. MMTV env-Hke sequences from differen, individual derived in ,he same PCR 
nrn were distinct from each „,her. This resul, indicates the ,ack of an ubiquitous PCR 
contamtnan, ma, would have produced a more co„sis,en, sequence Uta, should have been 
■denttcal (or nearly so) in me various reaction tubes. Furthermore DNA of individual subjects 
produced mtemally consign, MMTV env-like sequences from PCR run ,o PCR run The 
vanauons within me MMTV-re.ated sequences from a given patient may represen, a low number 
of Taq errors, bu, are also suggestive of variations expected of a replicating retrovirus (reverse 
trasenptase errors). 
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Example 2: RNase Protection Assay determination of MTV RNA levels 

Ribonuclease protection assay (RPA) is a quantitative assay which is frequently used by 
those of ordinary skill in the art to determine the of levels of specific RNA species without PCR 
amplification. Briefly, it is performed as follows for the transcripts of the present invention: a 
5 probe of uniform length is synthesized by in vitro transcription of a cloned template and labeled 
to high specific activity.. t353 S-labeled riboprobes were prepared from plasmids containing the 
cloned MTV fragments in the 150 to 400 bp size range. The probe was then hybridized to test 
RNA, and RNA fragments that remain unhybridized are not protected from digestion by RNAse 
A/Tl. The fragments of the labeled riboprobe protected by hybridized RNA were visualized and 

10 analyzed following separation on a denaturing polyacrylamide gel. Levels of MTV-related 
mRNA are compared in each sample to levels of mRNA produced by the P-actin gene, a 
"housekeeping" gene to ensure integrity of the RNA sample (i.e., that the RNA has not been 
degraded). RPA detected RNA in 2 out of 3 PCR positive breast tumors (see Figure 3). RPA is 
10-15 times more sensitive than "northern" analysis for detection of rare messenger RNA. RPA 

15 analysis is performed directly on total RNA, without any prior manipulation, which can 
introduce errors in the quantitative analysis. 

Example 3: RT-PCR determination of MTV RNA levels 

MTV mRNAs may also be detected using real time RT-PCR (reverse transcriptase linked 
polymerase chain reaction). For example the I-Cycler (Biorad), with a fluoroscopic detection 
20 facility is capable of determining the real time kinetics of the PCR amplification product by 
quantifying the PCR product in the log-linear phase of the PCR reaction. Using this method, a 
real-time thermocycler can reliably compare miniscule amounts of known nucleotide template in 
a reproducible fashion. 

Example 4: Detection of anti-MTV antibodies in blood by Western blot analysis 

25 The InsectSelect™ System (Invitrogen) which allows stable production of recombinant 

proteins in insect cells in a manner similar to well-characterized baculovirus expression systems 
may be used to generate recombinant MTV proteins. This is a virus-free system which allows 
creation of stable cell lines that continuously produce high-quality protein. Stable cell lines 
generated in about 9 days may be used for continuous, long-term production of recombinant 

30 proteins. The InsectSelect™ expression system is based on a plasmid vector which carries an 
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antibiotic resistance gene for selecting stably expressing insect cell lines. This expression vector 
uses the immediate early promoter, OpIE2, from the Douglas Fir Tussoc moth OpMNPV 
baculovirus for gene expression. OpIE2 is a strong transcriptional promoter in lepidopterin (Sf9, 
5121, High Five™ (Invitrogen)) as well as mosquito and dipterin cell lines. The expression 
5 vector pIZ/V5-His (Invitrogen) has a multiple cloning site and several features that simplify 
production and analysis of recombinant proteins in insect cells. It includes a Zeocin-resistance 
gene that allows for rapid selection of stably transfected cells, a C-terminal tag encoding the V5 
epitope, and a C-terminal polyhistidine (6xHis) sequence. These latter features facilitate rapid 
protein detection with anti-V5 antibodies (Invitrogen) and protein purification with resins that 
io bind the polyhistidine. Alternative eukaryotic expression systems for MMTV related proteins 
are an in vitro rabbit reticulocyte lysate system (Promega) with in vitro transcribed and capped 
mRNA or the Sindbis virus Expression system (Invitrogen). r ^ 
MMTV-related genes may also be cloned into a pGEX vector (Pharmacia) and expressed 
in bacteria to make recombinant fusion protein with glutathione S-transferase (GST) protein for 
is immunoblot studies. This system has several advantages compared to other methodologies of 
protein expression, not the least of which is the ease of production, isolation, and purification of 
the recombinant protein. Fusion proteins generated with the GST protein typically remain 
soluble permitting the recovery from cell lysates. Denaturing conditions are not required during 
purification and therefore the antigenic and enzyme properties of the protein are often 
20 maintained. Moreover, the GST fusion protein may be efficiently and rapidly purified by 
immobilizing the GST on glutathione coated beads or columns and eluted with reduced 
glutathione. 

To perform a western immunoblot, MTV proteins are dissolved in lysis buffer (0.25 M 
Tris Base, pH 6.8, containing 4% sodium dodecyl sulfate, 10% dithiothreitol, 20% glycerol, and 

25 0.01% w/v bromophenol blue), heated to 100°C for 3 min and subjected to sodium dodecyl 
sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) on a 10% polyacrylamide gel. These 
proteins may then be transferred electrophoretically to a nitrocellulose membrane (Protran™; 
Schleicher & Schuell) in Tris-glycine (pH 8.3) buffer with 20% methanol. The blots may be 
incubated overnight in blocking buffer (0.02 M Tris, 0.1 M NaCl, heat inactivated goat serum, 

30 0.01% thimerosal, and 5% nonfat dry milk) with serum or plasma from the human or animal 
being tested (1:100 dilution or optimal dilution). Incubation with secondary antibodies, 
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biotinylated anti-human (or other species) IgG goat antibodies diluted 1:500 or 1:1,000 in 
blocking buffer, and with avidin-horseradish peroxidase may be performed at room temperature 
for 2 hours. The immunoblots may be developed with 7.8 mM 4-chloro- 1 -napthol and 0.03% 
hydrogen peroxide. Bands corresponding to MTV proteins are quantitated by scanning and 
5 processed with image analysis software (NIH Image). 

Example 5: Detection of anti-MTV antibodies by enzyme linked immunoassay 

Antibodies which specifically bind to MTV protein(s) may be detected using an enzyme- 
linked immunoassay with MTV protein or proteins as the target for the antibodies. In this 
technique MTV proteins produced in an expression system, such as the baculovirus/insect cell 

io clutter system or purified from virus preparations may be bound to the bottom of wells in a 
multiwell plastic microtiter plate. Serum or plasma from humans or other species is diluted to an 
empirically determined optimum dilution and incubated from 1 hour to overnight at about 25 °C 
(room temperature). The wells are then washed three times in a saline solution containing 
Tween® 20 (Aldrich) or NP-40 (currently available as Igepal™ CA-630, Sigma) detergents using 

is an automated plate washer. Antibodies bound to the MTV proteins may then be detected by 
reacting the wells sequentially with buffers containing biotinylated goat anti-human 
immunoglobulin, followed by avidin coupled to horseradish peroxidase, and finally 3,3', 5,5'- 
tetramethylbenzidine a substrate for horseradish peroxidase which produces a colored reaction 
product. Between each step the well is typically washed three times in a saline solution 

20 containing Tween® 20 or NP-40 detergents using an automated plate washer. The colored 
reaction produced in each well is quantitated using a spectrophotometer plate reader. The 
amount of colored reaction product is proportional to the amount of antibodies to MTV present 
in the original sample. 

Example 6: Detection of MTV retroviral proteins in situ using immunohistochemistry 

25 MTV retroviral proteins may be detected in tissue samples by immunohistochemical 

assays. In this technique MTV proteins produced in an expression system, such as the 
baculovirus/insect cell clutter system or purified from virus preparations may be used to generate 
monoclonal or polyclonal antibodies, by methods well known to those skilled in the art. These 
antibodies may then be labeled with a detection moiety such as biotin, fluorescein, horseradish 

30 peroxidase or other labels used for this purpose. The labeled antibody preparations are then 
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diluted to an empirically determined optimum dilution and incubated with fixed or frozen thin 
sections or cell preparations from samples to be tested for the presence of MTV proteins. The 
samples are processed, as dictated by the type of label used, and the binding of the antibody to 
MTV retroviral proteins may be detected by microscopy, autoradiography, flow cytometry, etc., 
again, as dictated by the identity of the labeling moiety. 

Example 7: Preparation of a pharmaceutical composition capable of eliciting an 

immunological response, against human mammary tumor virus, in humans. 
A pharmaceutical composition to be used to induce an immunological response an in 
humans or animals to MTV protein may be developed as follows. MTV proteins produced in an 
expression system, such as the baculovirus/insect cell culture system described above or purified 
from virus preparations may be extensively purified using column chromatography and/or any 
other methodologies commonly employed by skilled artisans. The purified MTV proteins may 
then be mixed or emulsified with a suitable adjuvant preparation (currently alum is the only 
adjuvant approved for use in humans) to produce a vaccine. This immunogenic composition 
may then be injected subcutaneously or intradermally into the target animal. 

All of the composition and methods disclosed and claimed herein can be made and 
executed without undue experimentation in light of the present disclosure. While the 
compositions and methods of this invention have been described in terms of preferred 
embodiments, it will be apparent to those of skill in the art that variations may be applied to the 
compositions and methods and in the steps or in the sequence of steps of the method described 
herein without departing from the concept, spirit and scope of the invention. More specifically, 
it will be apparent that certain agents which are both chemically and physiologically related may 
be substituted for the agents described herein while the same or similar results would be 
achieved. All such similar substitutes and modifications apparent to those skilled in the art are 
deemed to be within the spirit, scope and concept of the invention as defined by the appended 
claims. 
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CLAIMS: 

1. An isolated DNA molecule comprising a DNA sequence that has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21, 25, or 27; or 

5 b) at least 99% identity with one or more of the following: at least 340 contiguous bases 

of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence represented by 
nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous nucleotides of the 
sequence represented by nucleotides 1-400 of SEQ ID NO:4, at least 130 contiguous 
nucleotides of the sequence represented by nucleotides 1-360 of SEQ ID NO:5, at 

10 least 130 contiguous nucleotides of SEQ ID NO:6, at least 140 contiguous 

nucleotides of the sequence represented by nucleotides 1-380 of SEQ ID NO:7, at 
least 210 contiguous nucleotides of the sequence represented by nucleotides 20-462 
of SEQ ID NO:8, at least 160 contiguous nucleotides of the sequence represented by 
nucleotides 97-462 of SEQ ID NO:21 , at least 1 70 contiguous nucleotides of SEQ ID 

15 NO:23, at least 100 contiguous nucleotides of the sequence represented by 

nucleotides 337-462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the 
sequence represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 
contiguous nucleotides of SEQ ID NO:29; or 

c) at least 92% identity with SEQ ID NO: 1 0. 

20 2. The isolated DNA molecule of claim 1 which comprises SEQ ID NO: 2, 3, 4, 5, 6, 7, 8, 
10, 21, 23, 25, 27, or 29. 

3. The isolated DNA molecule of claim 1 wherein the DNA sequence is derived from 
human, rhesus macaque, or cat. 

4. The isolated DNA molecule of claim 1, further comprising a detection moiety. 

25 5. The isolated DNA molecule of claim 1, which is suspended or dissolved in a diluent. 

6. The isolated DNA molecule of claim 5 wherein the diluent is a buffered aqueous 
solution. 
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The isolated DNA molecule of claim 5 wherein the DNA is present at a concentration of 
from about 0.1 ng/ul to 100 ug/nl. 

A purified polypeptide encoded by a DNA molecule comprising a DNA sequence that 
has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21, 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous bases 
of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence represented by 
nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous nucleotides of the 
sequence represented by nucleotides 1-400 of SEQ ID NO:4, at least 130 contiguous 
nucleotides of the sequence represented by nucleotides 1 -360 of SEQ ID NO:5, at 
least 130 contiguous nucleotides of SEQ ID NO:6, at least 140 contiguous '- : 
nucleotides of the sequence represented by nucleotides 1-380 of SEQ ID NO:7 5 at 
least 210 contiguous nucleotides of the sequence represented by nucleotides 2o'-462 
of SEQ ID NO:8, at least 160 contiguous nucleotides of the sequence represented by 
nucleotides 97-462 of SEQ ID NO:21, at least 170 contiguous nucleotides of SEQ ID 
NO:23, at least 100 contiguous nucleotides of the sequence represented by 
nucleotides 337-462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the • 
sequence represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 
contiguous nucleotides of SEQ ID NO:29; or 

c) at least 92% identity with SEQ ID NO: 10. 

The purified polypeptide of claim 8 which is SEQ ID NO: 12, 13, 14 15 16 17 18 20 
22, 24, 26, 28, or 30. 

The purified polypeptide of claim 8 which is present in a pharmaceutical composition 
An annbody which recognizes a polypeptide encoded by a DNA molecule comprising a 
DNA sequence that has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21 , 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous bases 
of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence represented by 
nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous nucleotides of the 
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nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous nucleotides of the 
sequence represented by nucleotides 1-400 of SEQ ID NO:4, at least 130 contiguous 
nucleotides of the sequence represented by nucleotides 1-360 of SEQ ID NO:5, at 
least 130 contiguous nucleotides of SEQ ID NO:6, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-380 of SEQ ID NO:7, at 
least 210 contiguous nucleotides of the sequence represented by nucleotides 20-462 
of SEQ ID NO: 8, at least 160 contiguous nucleotides of the sequence represented by 
nucleotides 97-462 of SEQ ID NO:21, at least 170 contiguous nucleotides of SEQ ID 
NO:23, at least 100 contiguous nucleotides of the sequence represented by 
nucleotides 337-462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the 
sequence represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 
contiguous nucleotides of SEQ ID NO:29; or 

c) at least 92% identity with SEQ ID NO: 1 0; 

wherein said antibody was generated using said polypeptide. 

12. The antibody of claim 1 1 which is a monoclonal antibody. 

13. An RNA corresponding to a DNA molecule a DNA molecule comprising a DNA 
sequence that has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21, 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous bases 
of SEQ ID NO:2, at least 1 50 contiguous nucleotides of the sequence represented by 
nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous nucleotides of the 
sequence represented by nucleotides 1-400 of SEQ ID NO:4, at least 130 contiguous 
nucleotides of the sequence represented by nucleotides 1-360 of SEQ ID NO:5, at 
least 130 contiguous nucleotides of SEQ ID NO:6, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-380 of SEQ ID NO:7, at 
least 210 contiguous nucleotides of the sequence represented by nucleotides 20-462 
of SEQ ID NO:8, at least 160 contiguous nucleotides of the sequence represented by 
nucleotides 97-462 of SEQ ID NO:21, at least 170 contiguous nucleotides of SEQ ID 
NO:23, at least 100 contiguous nucleotides of the sequence represented by 
nucleotides 337-462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the 
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sequence represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 
contiguous nucleotides of SEQ ID NO:29; or 

c) at least 92% identity with SEQ ID NO: 1 0. 

The isolated DNA molecule of claim 1 which is incorporated in a vector; 
wherein said DNA sequence is under the transcriptional control of a heterologous 
promoter. 

The vec.or of claim ,4 which is capable of expressing the DNA sequence in a, leas, one 
•f the following cell types: insec, cells, bacterial cells, avian cells, yeas, cells, or 
mammalian cells. 

The vector of claim 14 wherein said vector is capable of episomal replication or 
chromosomal integration in at least one of the following cel. types: insect cells, bacterial 
cells, avian cells, yeast cells, or mammalian cells. 

A method for the detecting the presence of a mammary tumor viral DNA comprising the 
steps of: 

i) obtaining a biological sample suspected of containing DNA which comprises a 
sequence molecule a DNA molecule comprising a DNA sequence that has- 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21 , 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous 
bases of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence 
represented by nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-400 of SEQ ID NO:4 at 
least 130 contiguous nucleotides of the sequence represented by nucleotides 1-360 
of SEQ ID NO:5, at least 130 contiguous nucleotides of SEQ ID NO:6, at least 
140 contiguous nucleotides of the sequence represented by nucleotides' 1-380 of 
SEQ ID NO:7, at least 210 contiguous nucleotides of the sequence represented by 
nucleotides 20-462 of SEQ ID NO:8, at least 160 contiguous nucleotides of the 
sequence represented by nucleotides 97-462 of SEQ ID NO:21, at least 170 
contiguous nucleotides of SEQ ID NO:23, at least 100 contiguous nucleotides of 
the sequence represented by nucleotides 337-462 of SEQ ID NO:25, at least 300 
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contiguous nucleotides of the sequence represented by nucleotides 85-462 of SEQ 
ID NO:27, or at least 200 contiguous nucleotides of SEQ ID NO:29; or 
c) at least 92% identity with SEQ ID NO: 1 0; 

ii) carrying out a polymerase chain reaction to amplify the suspected DNA sequence 
as defined in step i); and, 

iii) determining the sequence of, or otherwise characterizing, the amplicons produced 
in step ii) to determine whether or not the DNA sequence as defined in step i) is 
present in the sample. 

The method of claim 17 wherein the biological sample is obtained from human, rhesus 
macaque, or cat. 

A method of detecting the presence of antibodies which recognize one or more mammary 
tumor virus polypeptides, the method comprising the steps of: 

i) obtaining a sample suspected of containing antibodies specific for mammary 
tumor viral antibodies; 

ii) obtaining at least one purified polypeptide encoded by a DNA molecule 
comprising a DNA sequence that has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 2 1 , 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 
contiguous bases of SEQ ID NO:2, at least 150 contiguous nucleotides of the 
sequence represented by nucleotides 1 -380 of SEQ ID NO:3, at least 1 40 
contiguous nucleotides of the sequence represented by nucleotides 1-400 of 
SEQ ID NO:4, at least 130 contiguous nucleotides of the sequence represented 
by nucleotides 1-360 of SEQ ID NO:5, at least 130 contiguous nucleotides of 
SEQ ID NO:6, at least 140 contiguous nucleotides of the sequence represented 
by nucleotides 1-380 of SEQ ID NO:7, at least 210 contiguous nucleotides of 
the sequence represented by nucleotides 20-462 of SEQ ID NO:8, at least 160 
contiguous nucleotides of the sequence represented by nucleotides 97-462 of 
SEQ ID NO:21, at least 170 contiguous nucleotides of SEQ ID NO:23, at least 
100 contiguous nucleotides of the sequence represented by nucleotides 337- 
462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the sequence 
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represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 
contiguous nucleotides of SEQ ID NO:29; or 
c) at least 92% identity with SEQ ID NO: 1 0; 
in) performing an immunochemical analysis using the sample of step i) and the 

polypeptide of step ii); and, 
iv) analyzing the results of step iii) to determine whether or not antibodies which 
specifically interact with the peptide of step ii) are present in the sample. 

The method of claim 19 wherein the immunochemical analysis of step iii) is a Western 
blot analysis. 



30 



21 . The method of claim 1 9 wherein the immunochemical analysis of step iii) is 
en2^me-linked immunosorbant assay (ELIZA) analysis. 

22. The method of claim 19 wherein one or more of the polypeptides are derived from the 
env, or gag, gene and the antibodies are specific for one or more of the following 
polypeptides SEQ ID NO: 12, 13, 14, 15, 16, 17, 18, 20, 22, 24, 26, 28, or 30. 

The method of claim 19 wherein the sample of step ii) comprises antibodies derived from 
a human, rhesus macaque, or cat. 

A diagnostic kit for detecting DNA or RNA from a mammary tumor virus, said kit 
comprising a reagent of a DNA molecule that comprising a DNA sequence that has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21 , 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous bases 
of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence represented by 
nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous nucleotides of the 
sequence represented by nucleotides 1-400 of SEQ ID NO:4, at least 130 contiguous 
nucleotides of the sequence represented by nucleotides 1-360 of SEQ ID NO:5, at 
least 130 contiguous nucleotides of SEQ ID NO:6, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-380 of SEQ ID NO:7, at 
least 210 contiguous nucleotides of the sequence represented by nucleotides 20-462 
of SEQ ID NO:8, at least 160 contiguous nucleotides of the sequence represented by 
nucleotides 97-462 of SEQ ID NO:21, at least 170 contiguous nucleotides of SEQ ID 
NO:23, at least 100 contiguous nucleotides of the sequence represented by 
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nucleotides 337-462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the 
sequence represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 
contiguous nucleotides of SEQ ID NO:29; or 
c) at least 92% identity with SEQ ID NO: 1 0. 

s 25. A diagnostic kit for detecting antibodies to mammary tumor virus, where said kit 
comprises a reagent comprises one or both of the following: 
i) one or more polypeptides encoded by a DNA molecule comprising a DNA 
sequence that has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21 , 25, or 27; or 
io b) at least 99% identity with one or more of the following: at least 340 contiguous 

bases of SEQ ID NO:2, at least 1 50 contiguous nucleotides of the sequence 
represented by nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-400 of SEQ ID NO:4, at 
least 130 contiguous nucleotides of the sequence represented by nucleotides 1-360 
'5 of SEQ ID NO:5, at least 130 contiguous nucleotides of SEQ ID NO:6, at least 

140 contiguous nucleotides of the sequence represented by nucleotides 1-380 of 
SEQ ID NO:7, at least 210 contiguous nucleotides of the sequence represented by 
nucleotides 20-462 of SEQ ID NO:8, at least 160 contiguous nucleotides of the 
sequence represented by nucleotides 97-462 of SEQ ID NO:21, at least 170 
20 contiguous nucleotides of SEQ ID NO:23, at least 1 00 contiguous nucleotides of 

the sequence represented by nucleotides 337-462 of SEQ ID NO:25, at least 300 
contiguous nucleotides of the sequence represented by nucleotides 85-462 of SEQ 
ID NO:27, or at least 200 contiguous nucleotides of SEQ ID NO:29; or 
c) at least 92% identity with SEQ ID NO: 1 0; or 
") 3X1 antibody specific for a polypeptide according to step i). 

A method for the detection of MTV RNA in a sample which comprises the following 
steps: 

i) obtaining an sample suspected of containing RNA which encoded by one or more 
DNA sequences that have: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21 , 25, or 27; or 
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b) at least 99% identity with one or more of the following: at least 340 contiguous 
bases of SEQ ID NO:2, at least 1 50 contiguous nucleotides of the sequence 
represented by nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-400 of SEQ ID NO:4, at 
least 130 contiguous nucleotides of the sequence represented by nucleotides 1-360 
of SEQ ID NO:5, at least 130 contiguous nucleotides of SEQ ID NO:6, at least 
140 contiguous nucleotides of the sequence represented by nucleotides 1-380 of 
SEQ ID NO:7, at least 210 contiguous nucleotides of the sequence represented by 
nucleotides 20-462 of SEQ ID NO:8, at least 160 contiguous nucleotides of the 
sequence represented by nucleotides 97-462 of SEQ ID NO:21, at least 170 
contiguous nucleotides of SEQ ID NO:23, at least 100 contiguous nucleotides of 
the sequence represented by nucleotides 337-462 of SEQ ID NO:25, at least 300.:: 
contiguous nucleotides of the sequence represented by nucleotides 85-462 of SEQ 
ID NO:27, or at least 200 contiguous nucleotides of SEQ ID NO:29; or 

c) at least 92% identity with SEQ ID NO: 10;; 

iv) carrying out an RNAse protection assay (RPA); and, 

iii) analyzing the RPA results to determine whether RNA as defined in step i) is 
present in the sample and optionally quantitating said RNA. 
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SEQUENCE LISTING 

<110> Garry, Robert F. 
5 <120> Human Endogenous Retrovirus in Breast Cancer 
<130> TUMC:012 

<140> 
10 <141> 



15 



20 



25 



30 



<160> 30 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 461 
<212> DNA 

<213> Mouse mammary tumor virus 
<400> 1 

tcccttccct cgcctagtgt agatcagtca 
ggaaattata ctccccctgt caataaagag 
gaacctacat ggttctggga aaattctcct 
ctagttcccc atacagaatt gtttcgctta 
aggccaggat ttcaagaaca tgacatgatt 
gccatattat taggattacc tcagctaata 
atttcctgtt cttcttgtag attgactaat 
gcgatcatag tcaagaggcc gccatacgtg 



gatcagatta 
gttcatcgat 
aaggatccca 
gttgcagcct 
cctacatctg 
gatatagaga 
tgtttagatt 
ctgctacctg 



aaagcaaaaa 
ggtatgaagc 
atgatagaga 
caagatatct 
cctgtgttac 
aaagaggatc 
cttctgccta 
t 



ggatctattt 60 
aggatgggta 12 0 
ttttactgct 180 
tattctcaaa 240 
ttacccttat 300 
tacttttcat 360 
cgactatgca 420 
461 
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40 



45 



<210> 2 
<211> 461 
<212> DNA 

<213> Human Mammary Tumor Virus 



<400> 2 

tcccttccct 

ggaaattata 

gaacctacat 

ctagttcccc 

aggccaggat 

gccatattat 

atttcctgtt 

gcgatcatag 



cgcctagtgt 
ctccccctgt 
ggttctggga 
atacagaatt 
ttcaagaaca 
taggattacc 
cttcttgtag 
tcaagaaggc 



agatctgtca 
caataaagag 
aaattctcct 
gtttcgctta 
tgacatgatt 
tcagctaata 
attgactaat 
gccatatgtg 



gatcagatta 
gttcatcgat 
aaggatccca 
gttgcagcct 
cctacatctg 
gatatagaaa 
tgtttagatt 
ctgctacctg 



aaagcaaaaa 
ggtatgaagc 
atgatagaga 
caagatatct 
cctgtgttac 
aaagaggatc 
cttctgccta 
t 



ggatctattt 60 
aggatgggta 120 
ttttactgct 180 
tattctcaaa 240 
ttacccttat 300 
tacttttcat 360 
cgactatgca 420 
461 



<210> 3 
<211> 461 
50 <212> DNA 

<213> Human Mammary Tumor Virus 

<400> 3 

55 ggaaattata a9a ' Ca9tCa 9 at «3"ta aaaacaaaaa ggatctattt 60 

gaacctaJt; * caataaa 9 a 9 gttcatcgat ggtatgaagc aggatgggta 120 

ctaottcc^ f 9Ctct 99? a aa attctcct aaggatccca atgatagaga ttttactgct 180 
ctagttcccc atacagaatt gtttcgctta gttgcagcct caagacatct tattctcaaa 240 



<WO 0100829A3JA> 



WO 01/000829 



PCT/USOO/18279 



aagccaggat ttcaagaaca tgagatgatt cctacatctg cctgtgttac ttacccttat 300 

gccatattat taggattacc tcagctaata gatatagaga aaagaggatc tacttttcat 360 

atttcctgtt cttcttgttg attgactaat tgtttagatt cttctgccta cgactatgca 420 

gcgatcatag tcaagaggcc gccatacgtg ctgctacctg t 461 



10 



<210> 4 

<211> 461 

<212> DNA 

<213> Human Mammary Tumor Virus 



<400> 4 

tcccttccct 

ggaaattata 

15 gaacctactt 
ctagttcccc 
aggccaggat 
gccatattat 
atttcctgtt 

20 gcgatcatag 



cgcctagtat 
ctccccctgt 
ggttctggga 
atacagaatt 
ttcaagaaca 
taggattacc 
cttcttgtag 
tcaagaggcc 



agaacagtca 
caataaagag 
aaattctcct 
gtttcgctta 
tgagatgatt 
tcagctaata 
attgactaat 
gccatacgtg 



aatcagatta 
gttcatcgat 
aaggatccca 
gttgcagcct 
cctacatctg 
gatatagaga 
tgtttagact 
ctgctacctg 



aaagcaaaaa 
ggtatgaagc 
atgatagaga 
caagacatct 
cctgtgttac 
agagaggatc 
cttctgccta 
t 



ggatctactt 60 
aggatgggta 120 
ttttactgct 180 
tattctcaaa 240 
ttacccttat 300 
tacttttcat 360 
cgactatgca 420 
461 



25 



<210> 5 

<211> 461 

<212> DNA 

<213> Rhesus Mammary Tumor Virus 



<400> 5 
tcccttccct 

30 ggaaattata 
gaacctactt 
ctagttcccc 
aagccaggat 
gccatattat 

35 atttcctgtt 
gcgatcatag 



cgcctagtgt 
ctccccctgt 
ggttctggga 
atacagaatt 
ttcaagaaca 
taggattacc 
cttcttgtag 
tcaagaggcc 



agatcagtca 
caataaagag 
aaattctcct 
gtttcgctta 
tgagatgatt 
tcagctaata 
attgactaat 
gccatacgtg 



aatcagatta 
gttcatcgat 
aaagatccca 
gttgcagcct 
cctacatctg 
gatatagaga 
tgtttagatt 
ctgctacctg 



aaagcaaaaa 
ggtatgaagc 
atgatagaga 
caagacatct 
cctgtgttac 
aaagaggatc 
cttctgccta 
t 



ggatctattt 60 
aggatgggta 12 0 
ttttactgct 180 
tattctcaaa 240 
ttacccttat 300 
tacttttcat 360 
cgactatgca 420 
461 



<210> 6 

40 <211> 461 

<212> DNA 

<213> Rhesus Mammary Tumor Virus 



<400> 6 
45 tcccttccct 
ggaaattata 
gaacctactt 
ctagttcccc 
aagccaggat 
50 gccatattat 
atttcctgtt 
gcgatcatag 



cgcctagtgt 
ctccccctgt 
ggttctggga 
atacagaatt 
ttcaagaaca 
taggattacc 
cttcttgtag 
tcaagaggcc 



agatcagtca 
caataaagag 
aaattctcct 
gtttcgctta 
tgagatgatt 
tcagctaata 
accgactaat 
gccatacgtg 



aatcagatta 
gttcatcgat 
aaagatccca 
gttgcagcct 
cctacatctg 
gatatagaga 
tgtttagatt 
ctgctacctg 



aaagcaaaaa 
ggtatgaagc 
atgatagaga 
caagacatct 
cctgtgttac 
aaagaggatc 
cttctgccta 
t 



ggatctattt 
aggatgggta 
ttttactgct 
tattctcaaa 
ttacccttat 
tacttttcat 
cgactatgca 



60 

120 

180 

240 

300 

360 

420 

461 



55 <210> 7 

<211> 461 
<212> DNA 



DCIO: <WO 010082SA3JA> 
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<213> Cat Mammary Tumor Virus 
<400> 7. 



JO 



litiitttz :it:tT t 9 g t xsxs ss? tta aaa *— »«cu rt t 60 

gaacctacat ggttctggj S££Sf S^SS JKXS f??"^ 120 
ctagttcccc atacaoaatt ottrrnpfl ., "ggatccca atgatagaga ttttattgct 180 
aagccaggat ttcaa^ca S£SIE lltltT? Caa * acatct ^attctcaaa 240 
gccatattat taggattacc" tcafcX*^ CCtaCatct9 cctgtgttac ttacccttat 3 00 
atttcctgtt ct?c a tgtaq attof^* * f^" 39 ^ 3 "agaggatc tacttttcat 360 
9cg.tc.Lg t«££2 ££££ f cgactatgca 420 

4 61 



<210> 8 
15 <211> 461 
<212> DNA 

<213> Cat Mammary Tumor Virus 



<400> 8 



20 



25 



SSS a L aa a C a a aga Ca SttcT"* 3aa * Caaaaa ««cUctt 60 

gaacctacat ggttctggga atl^Sr? gttC \ tC9at ^atgaagc aggatgggta 120 
ctagttcccc acacagaalt gtScgctta £K ? -tgatagaga ttttactgct 180 
aagccaggat ttcaagaaca caagacStt Sc2e2c? Caa9aCatCC ^attctcaaa 240 
gccatattat taagattacc tcaf^ffr cctacatctg cctgtgttac ttacccttat 300 
atttcctgtt cttcJtgtag at^ctaat aaa9a " atc tacttttcat 360 

gcgatcatag tcaaga^gc^ gccaSgt" cfgct^t" f^ 9 ^ cgactatgca 42 



0 

461 



30 <210> 9 

<211> 104 
<212> DNA 

<213> Mouse mammary tumor virus 
35 <400> 9 



2sss sssss sssss: ssss :."r c " 9 — » 



40 <210> 10 
<211> 104 
<212> DNA 

<213> Cat Mammary Tumor Virus 
45 <400> 10 



SEES ES= S£~ £ 

50 <210> 11 
<211> 153 
<212> PRT 

<213> Mouse mammary tumor virus 
55 <400> li 

Ser Leu Pro Ser Pro Ser Val Asp Gin Ser Asp Gln Ile Lys Ser Lys 

10 15 



<WO 0100829A3_IA> 
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4- 



Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
20 25 30 

5 Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
35 40 45 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 

Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg Tyr Leu lie Leu Lys 

70 75 80 

^ Arg Pro Gly Phe Gin Glu His Asp Met He Pro Thr Ser Ala Cys Val 



95 



Thr Tyr Pro Tyr Ala lie Leu Leu Gly Leu Pro Gin Leu lie Asp He 
100 105 110 

20 Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Leu 
115 120 12 5 

Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala lie He Val 
25 1 30 "5 14 0 

Lys Arg Pro Pro Tyr Val Leu Leu Pro 
145 150 

30 <210> 12 
<211> 153 
<212> PRT 

<213> Human Mammary Tumor Virus 
35 <400> 12 

Ser Leu Pro Ser Pro Ser Val Asp Leu Ser Asp Gin lie Lys Ser Lys 



10 



15 



Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
20 25 30 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 



35 40 



45 



45 Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
50 55 60 

Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg Tyr Leu He Leu Lys 

50 70 75 80 

Arg Pro Gly Phe Gin Glu His Asp Met He Pro Thr Ser Ala Cys Val 

85 90 95 

Thr Tyr Pro Tyr Ala He. Leu Leu Gly Leu Pro Gin Leu He Asp He 

100 105 no 

Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg 



Leu 



: <WO 01 00829 A3 J A> 
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-5 



115 120 125 

Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala lie He Val 



5 135 140 

Lys Lys Ala Pro Tyr Val Leu Leu Pro 
145 150 

10 <210> 13 
<211> 152 
<212> PRT 

<2l3> Human Mammary Tumor Virus 
15 <400> 13 

Ser Leu Pro Ser Pro Ser Val Asp Gin Ser Asp Gin He Lys- Asn Lys 
5 10 15 

^ Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
20 25 30 



Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 

40 45 

25 ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 

55 60 
Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg His Leu He Leu Lys 
30 # ° 75 80 

Lys Pro Gly Phe Gin Glu His Glu Met lie Pro Thr Ser Ala Cys Val 
85 90 95 

3j Thr Tyr Pro Tyr Ala He Leu Leu Gly Leu Pro Gin Leu He Asp He 

105 110 

Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Leu Thr 

120 125 

40 .Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala He He Val Lys' 



140 



Arg Pro Pro Tyr Val Leu Leu Pro 
"5 150 

45 



<210> 14 
<211> 153 
<212> PRT 
50 <213> Human Mammary Tumor Virus 



<400> 14 

Ser Leu Pro Ser Pro Ser He Glu Gin Ser Asn Gin He Lys Ser Lys 
55 5 10 15 

Lys Asp Leu Leu Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 



25 



30 



301 D: <w O 0100829A3_IA> 
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-6- 



Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
35 40 45 

5 Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
50 55 60 

Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg His Leu He Leu Lys 
65 70 75 80 

10 

Arg Pro Gly Phe Gin Glu His Glu Met He Pro Thr Ser Ala Cys Val 
85 90 95 

Thr Tyr Pro Tyr Ala He Leu Leu Gly Leu Pro Gin Leu He Asp He 
15 100 105 HO 

Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Leu 
115 120 125 

20 Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala He He Val 
130 135 140 

Lys Arg Pro Pro Tyr Val Leu Leu Pro 
145 150 

25 

<210> 15 
<211> 153 
<212> PRT 

30 <213> Rhesus Mammary Tumor Virus 
<400> 15 

Ser Leu Pro Ser Pro Ser Val Asp Gin Ser Asn Gin He Lys Ser Lys 
15 10 15 

35 

Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
20 ' 25 30 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
40 35 40 45 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
50 55 6 0 

45 Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg His Leu He Leu Lys 
65 70 75 80 

Lys Pro Gly Phe Gin Glu His Glu Met He Pro Thr Ser Ala Cys Val 
85 90 95 

50 

Thr Tyr Pro Tyr Ala He Leu Leu Gly Leu Pro Gin Leu He Asp He 
100 105 HO 

Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Leu 
55 115 120 125 

Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala He He Val 



OCID: <WO CnO0629A3_IA> 
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-7- 



130 135 140 

Lys Arg Pro Pro Tyr Val Leu Leu Pro 
145 150 

5 

<210> 16 
<211> 153 
<212> PRT 

10 <213> Rhesus Mammary Tumor Virus 
<400> 16 

Ser Leu Pro Ser Pro Ser Val Asp Gin Ser Asn Gin He Lys Ser Lys 
15 10 15 

15 

Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
20 25 30 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
20 3 5 4 0 4 5 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
50 55 60 

25 Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg His Leu He Leu Lys 
65 70 75 80 

Lys Pro Gly Phe Gin Glu His Glu Met He Pro Thr Ser Ala Cys Val 
85 90 95 

30 

Thr Tyr Pro Tyr Ala He Leu Leu Gly Leu Pro Gin Leu He Asp He 
100 105 110 

Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Pro 
35 115 120 125 

Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala He He Val 
130 135 140 

40 Lys Arg Pro Pro Tyr Val Leu Leu Pro 
145 150 



<210> 17 

45 <211> 153 

<212> PRT 

<213> Cat Mammary Tumor Virus 

<400> 17 

50 Ser Leu Pro Ser Pro Ser Val Glu Gin Ser Asp Gin He Lys Ser Lys 
1 5 10 15 

Lys Asp Leu Leu Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
20 25 ~ 30 

55 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
35 40 45 
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-8- 



Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
50 ~ 55 60 

5 Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg His Leu He Leu Lys 
65 70 75 80 

Lys Pro Gly Phe Gin Glu His Lys Met He Pro Thr Ser Ala Cys Val 
85 - 90 95 

10 

Thr Tyr Pro Tyr Ala He Leu Leu Arg Leu Pro Gin Leu He Asp He 
100 105 110 

Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Leu 
15 115 120 125 

Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala He He Val 
130 135 140 

20 Lys Arg Pro Pro Tyr Val Leu Leu Pro 
145 " 150 



<210> 18 
25 <211> 153 
<212> PRT 

<213> Cat Mammary Tumor Virus 
<400> 18 

30 Ser Leu Pro Ser Pro Ser Val Glu Gin Ser Asp Gin He Lys Ser Lys 
15 10 15 

Lys Asp Leu Leu Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
20 25 30 

35 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
35 40 45 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His, 
40 50 55 60 

Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg His Leu He Leu Lys 
65 70 75 80 

45 Lys Pro Gly Phe Gin Glu His Lys Met He Pro Thr Ser Ala Cys Val 

85 90 95 

Thr Tyr Pro Tyr Ala He Leu Leu Arg Leu Pro Gin Leu lie Asp He 
100 105 110 

50 

Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Leu 
115 120 125 

Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala lie lie Val 
55 130 135 140 

Lys Arg Pro Pro Tyr Val Leu Leu Pro 



DCID: <WO 0100829A3_IA> 



WO 01/000829 



PCT/USOO/18279 



-9 



145 



150 



<210> 19 
5 <211> 35 
<212> PRT 

<213> Mouse mammary tumor virus 
<400> 19 

10 Met Met Pro Arg Gly Glu Gly Ser Asp He Leu He Lys Gin Leu Ala 
1 5 10 is 

Trp Glu Lys Cys Lys Phe He Val Ser Gly Ser His Pro Pro Asn Thr 
20 25 30 



15 



Lys Asn Arg 
35 



20 <210> 20 
<211> 35 
<212> PRT 

<213> Cat Mammary Tumor Virus 



25 



30 



<400> 20 

Met Met Pro Arg Gly Glu Gly Ser Asp He Leu He Lys Gin Leu Ala 
1 5 10 15 

Tyr Lys Lys Cys Lys Phe He Val Pro Arg Ser Tyr Pro Ser Asn Thr 
20 25 ' 30 

Lys Asn Arg 
35 



35 



40 



45 



50 



<210> 21 
<211> 460 
<212> DNA 

<213> Human Mammary Tumor Virus 
<400> 21, 

tcccttccct cgcctagtgt agatcagtca gatcagatta 
gaaattatac tccccctgtc aataaagagg ttcatcggtg 
aacctacatg gttctgggaa aattctccta aggatcccaa 
tagttcccca tacagaattg tttcgcttag ttgcagcctc 
ggccaggatt tcaaggacat gacatgattc ctacatctgc 
ccatattatt aggattacct cagctaatag atatagaaaa 
tttcctgttc ttcttgtaaa ttgactaatt gtttagattc 
cgatcatagt caagaaggcg ccatatgtgc tgctacctgt 



aaagcaaaag 
gtatgaagca 
tgatagagat 
aagatatctt 
ctgtgttact 
aagaggatct 
ttctgcctac 



gatctatttg 60 
ggatgggtag 120 
tttactgctc 180 
attctcaaaa 240 
tacccttatg 300 
acttttcata 360 
gactatgcag 42 0 
460 



55 



<210> 22 
<211> 38 
<212> PRT 

<213> Human Mammary Tumor Virus 



<400> 22 



<WO 0100829A3_IA> 
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PCT/US00/18279 



- 10- 



Ser Leu Pro Ser Pro Ser Val Asp Gin Ser Asp Gin lie Lys Ser Lys 
15 10 15 

Arg lie Tyr Leu Glu lie He Leu Pro Leu Ser He Lys Phe He Gly 
5 ~ * 20 25 30 

Gly Met Lys Gin Asp Gly 
35 



10 

<210> 23 
<211> 461 
<212> DNA 

<213> Human Mammary Tumor Virus 

15 

<400> 23 

tcccttccct cgcctagtgt agatcagtca 
ggaaattata ctccccctgt caataaaggg 
gagcctacat ggttctggga aaattctcct 
20 ctagttcccc atacagaatt gtttcgctta 
aggccaggat ttcaaggaca tgacatgatt 
gccatattat taggattacc tcagctaata 
atttcctgtt cttcttgtaa attgactaat 
gcgatcatag tcaagaaggc gccatatgtg 

25 



gatcagatta aaagcaaaaa ggatctattt 6 0 
gttcatcgat ggtatgaagc aggatgggta 120 
aaggatccca atgatagaga ttttactgct 180 
gttgcagcct caagatatct tattctcaaa 240 
cctacatctg cctgtgttac ttacccttat 300 
gatatagaaa aaagaggatc tact tt teat 360 
ttgtttagat cttctgccta egaatatgea 420 
ctgctacctg t 461 



<210> 24 
<211> 153 
<212> PRT 
30 <213> Human Mammary Tumor Virus 



<400> 24 

Ser Leu Pro Ser Pro Ser Val Asp Leu Ser Asp Gin He Lys Ser Lys 
1 5 10 15 

35 

Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
20 25 30 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn. 
40 35 40 45 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
50 55 60 



45 Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg Tyr Leu He Leu Lys 
65 70 75 80 

Arg Pro Gly Phe Gin Glu His Asp Met He Pro Thr Ser Ala Cys Val 
85 90 95 

50 

Thr Tyr Pro Tyr Ala He Leu Leu Gly Leu Pro^ Gin Leu He Asp lie 
100 105 HO 



Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Leu 
55 115 ^ 120 125 

Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala He He Val 
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11 - 



130 



135 



140 



Lys Lys Ala Pro Tyr Val Leu Leu Pro 
145 150 



10 



15 



20 



<210> 25 
<211> 461 
<212> DNA 

<213> Human Mammary Tumor Virus 
<400> 25 



? 3 ::iti:ti sees iTr tta — *«— ^ tct a ttt 60 

gaacctacat ggttctgggi aaat^ctce! J'" ^ ^tatga.gc aggatgggta 120 
ctagttcccc ILcagKt gtttcgctca iSSSS atgatagaga ttttactgct 180 
aggccaggat ttcaagaaca Saca?«S 9" gca ? cct <=aagatatct tattctcaaa 240 
gccatattat taggaftlcc L? 9 CCtgtgttac "acccctat 300 

atttcctgtt cttcttoraa -f?™ f gatatagaaa aaagaggatc tacttttcat 360 
gcgatcaLg £™ ££££ cgac t a t gca 4 20 



<210> 26 
<211> 153 
25 <212> PRT 

<213> Human Mammary Tumor Virus 



30 



<400> 26 

Ser Leu Pro Ser Pro Ser Val Asp Leu Ser Asp Gin lie Lys Ser Lys 



10 15 



Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Gly Val His 

25 30 

35 Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe T rp Glu Asn 

40 45 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
40 55 60 

Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg Tyr Leu He Leu Lys 

75 80 

^ Arg Pro Gly Phe Gin Glu His Asp Met He Pro Thr Ser Ala Cys Val 

90 95 

Thr Tyr Pro Tyr Ala He Leu Leu Gly Leu Pro Gin Leu He Asp He 

105 110 
SO Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Leu 

120 125 
Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala He He Val 
55 135 140 

Lys Lys Ala Pro Tyr Val Leu Leu Pro 



150 
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<210> 27 
<211> 460 
5 <212> DNA 

<213> Human Mammary Tumor Virus 

<400> 27 

tcccttccct cgcctagtgt agatcagtca gatcagatta aaagcaaaaa ggatctattt 60 

10 ggaaattata ctccccctgt caataaaggg gttcatcgat ggtatgaagc aggatgggta 120 

gagcctacat ggttctggga aaattctcct aaggatccca atgatagaga ttttactgct 180 

ctagttcccc atacagaatt gtttcgctta gttgcagcct caagatatct tattctcaaa 240 

aggccaggat ttcaagaaca tgacatgatt cctacatctg cctgtgttac ttacccttat 300 

gccatattat taggattacc tcagctaata gatatagaaa aaagaagatc tacttttcat 36 0 

15 atttcctgtt cttcttgtag attgactaat ttgtttagat cttctgccta cgaatatgca 420 

gcgatcatag tcaagaaggg ccatatgtgc tgctacctgt 46 0 

<210> 28 
20 <211> 152 
<212> PRT 

<213> Human Mammary Tumor Virus 
<400> 28 

25 Ser Leu Pro Ser Pro Ser Val Asp Leu Ser Asp Gin lie Lys Ser Lys 
15 10 15 

Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
20 25 30 



30 



45 



55 



Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
35 40 45 



Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
35 50 55 60 

Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg Tyr Leu lie Leu Lys 
65 70 75 80 

40 ' Arg Pro Gly Phe Gin Glu His Asp Met lie Pro Thr Ser Ala Cys Val 

85 90 95 

Thr Tyr Pro Tyr Ala lie Leu Leu Gly Leu Pro Gin Leu lie Asp lie 
100 105 110 



Glu Lys Arg Gly Ser Thr Phe His lie Ser Cys Ser Ser Cys Arg Leu 
115 120 125 



Thr Asn Cys Leu Asp Ser Ser Ala Tyr Glu Tyr Ala Ala lie He Val 
50 1 30 1 35 140 

Lys Lys Gly His Met Cys Cys Leu 
145 150 



<210> 29 
<211> 461 
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13 



10 



<212> DNA 

<213> Human Mammary Tumor Virus 



<400> 29 

tcccttccct 

ggaaattata 

gaacctactt 

ctagttcccc 

aagccaggat 

gccatattat 

atttcctgtt 

gcgatcatag 



cgcctagtgt 
ctccccctgt 
gattctggga 
atacagaatt 
ttcaagaaga 
taggattacc 
cttcttgtag 
tcaagaaggc 



agatcagtca 
caataaagag 
aaattctcct 
gttccgctta 
tgacatgatt 
tcagctaata 
attgactaat 
gccatacgtg 



gatcagatta 
gttcatcgat 
aaagatccca 
gttgcagcct 
cctacatctg 
gatatagaga 
tgtttagatt 
ctgctacctg 



aaaacaaaaa 
ggtatgaagc 
atgatagaga 
caagacatct 
cctgtgttac 
aaagaggatc 
cttctgccta 
t 



ggatctattt 60 
aggatgggta 120 
ttttactgct 180 
tattctcaaa 240 
ttacccttat 300 
tacttttcat 360 
cgactatgca 420 
461 
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<210> 30 *" , 

<211> 43 
<212> PRT 

<213> Human Mammary Tumor Virus 
<400> 30 

Ser Leu Pro Ser Pro Ser Val Asp Gin Ser Asp Gin He Lys Asn Lys 
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Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
20 25 ' 30 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr 
35 40 
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